Machine Learning for Population Health
AHealthcareZ - Healthcare Finance Explained
@ahealthcarez
Published: July 19, 2021
Insights
This video provides an in-depth exploration of machine learning (ML) applications in population health, demystifying the concept for a general audience. Dr. Eric Bricker begins by establishing a clear distinction between traditional, human-programmed "if-then" algorithms and true machine learning, where software autonomously learns and generates its own rules. He emphasizes that "real AI" lies in this self-learning capability. The presentation then pivots to a practical application, showcasing ClosedLoop.ai, an Austin-based startup that won the prestigious CMS AI Challenge, beating out 300 other organizations including IBM and the Mayo Clinic, for its prowess in applying ML to population health data.
The core purpose of applying machine learning in population health, as highlighted in the video, is to significantly improve the identification of high-risk individuals who are most likely to benefit from targeted interventions. This contrasts sharply with historical "predictive analytics" methods, which have proven largely ineffective in accurately pinpointing the small percentage of people (e.g., 5%) who drive a disproportionate amount of healthcare costs or adverse outcomes. ClosedLoop.ai's approach leverages ML to overcome these limitations, enabling health plans and systems to proactively intervene with individuals at high risk of complications, ER visits, or hospitalizations, thereby improving health outcomes and optimizing resource allocation.
The video further delves into three crucial aspects that make machine learning effective and trustworthy in a healthcare context. First, explainability is paramount; ML models cannot be black boxes, as healthcare professionals need to understand why a particular individual is flagged for intervention to build trust and ensure effective action. Second, addressing bias is critical, as historical health data often contains inherent biases related to demographics, income, or race, which ML models must be explicitly programmed to counteract. Third, ML's ability to handle messy data is a significant advantage, as it can infer insights (e.g., a diabetes diagnosis from insulin prescriptions, even without an ICD-10 code) in a way that traditional algorithms or human analysis often struggle with. A compelling case study of ClosedLoop.ai's COVID-19 Vulnerability Index demonstrates these principles in action, showing how ML can rapidly identify individuals at high risk of severe COVID-19 complications, leading to practical interventions like home delivery of groceries and prescriptions.
Finally, the discussion touches upon the transformative impact of the COVID-19 pandemic on ML in population health, particularly in two areas: speed and implementation. The pandemic underscored the need for rapid model development, with ClosedLoop.ai creating their COVID-19 index in a single weekend due to an existing platform. More profoundly, the video identifies the "limiting reagent" for wider ML adoption not as the software or data, but as the ability to persuade and consult with organizations to effectively integrate and apply these solutions. This highlights a critical gap in translation and implementation, emphasizing the need for strong consulting and sales capabilities to bridge the divide between advanced ML technology and its practical application in healthcare settings.
Key Takeaways:
- Machine Learning vs. Traditional Algorithms: Machine learning distinguishes itself by enabling software to autonomously learn and create its own "if-then" rules, unlike traditional algorithms that rely on human-programmed instructions. This self-learning capability is considered the hallmark of "real AI."
- Purpose of ML in Population Health: The primary goal is to accurately predict individuals who are at high risk of adverse health outcomes (e.g., hospitalizations, ER visits, complications) so that targeted, proactive interventions can be implemented.
- Ineffectiveness of Historical Predictive Analytics: Traditional methods of identifying high-risk populations have proven largely ineffective, often failing to accurately pinpoint the small percentage of individuals responsible for a majority of healthcare costs or negative outcomes.
- Importance of Explainability: For machine learning models to be trusted and utilized by healthcare professionals (nurses, physicians), their predictions cannot be black boxes. The rationale behind identifying a high-risk individual must be explainable and transparent.
- Explicitly Addressing Bias: Healthcare data inherently contains biases related to demographics, socioeconomic status, and other factors. ML models must be designed and programmed to explicitly identify and mitigate these biases to ensure equitable and accurate predictions across all populations.
- Handling Messy Data: Machine learning excels at making sense of imperfect or "messy" healthcare data. It can infer conditions or risks even when complete or perfectly coded data is absent, such as identifying diabetes from insulin prescriptions without an official ICD-10 code.
- Case Study: COVID-19 Vulnerability Index: ClosedLoop.ai developed an index using 21 measures (demographics, chronic conditions, utilization data) to predict with 80% sensitivity who would be at high risk of severe COVID-19 complications if infected.
- Practical Interventions: Based on the COVID-19 Vulnerability Index, health plans implemented real-world interventions for high-risk individuals, including arranging grocery deliveries and ensuring home delivery of prescription medications to minimize exposure risks.
- Speed of ML Development: With an established platform and framework, machine learning projects can be developed rapidly, as demonstrated by the COVID-19 index being created in a single weekend, enabling quick responses to emerging health challenges.
- The Limiting Factor is Consulting and Persuasion: The primary barrier to wider adoption and implementation of machine learning in population health is not the software, the data, or even the programming skill. Instead, it's the ability to effectively persuade and consult with organizations (health plans, hospitals, employers) on how, when, and why to apply these solutions.
- Opportunity for Consulting Firms: The identified "limiting reagent" highlights a significant opportunity for consulting and sales professionals to bridge the gap between advanced ML capabilities and organizational implementation, translating technical solutions into practical value.
Key Concepts:
- Machine Learning (ML): A subsegment of AI where computer software can create and improve algorithms on its own, learning from data without explicit programming of "if-then" statements.
- Population Health: An approach to health that aims to improve the health outcomes of a group of individuals, including the distribution of such outcomes within the group.
- Predictive Analytics: The use of data, statistical algorithms, and machine learning techniques to identify the likelihood of future outcomes based on historical data.
- Explainability (in ML): The ability to understand and interpret how a machine learning model arrives at its predictions or decisions, crucial for trust and adoption in critical fields like healthcare.
- Bias (in ML): Systematic and repeatable errors in a computer system that create unfair outcomes, such as favoring or disfavoring certain groups of people.
- Messy Data: Imperfect, incomplete, or inconsistently formatted data that often requires significant effort to clean and prepare for analysis.
Examples/Case Studies:
- ClosedLoop.ai: A company based in Austin, Texas, specializing in applying machine learning to population health data. They won the CMS AI Challenge, beating out 300 other organizations including IBM, the Mayo Clinic, and Deloitte.
- COVID-19 Vulnerability Index: A machine learning model created by ClosedLoop.ai to identify individuals at high risk of severe complications if they contracted COVID-19. This index was used by institutions like Johns Hopkins, the University of Texas Medical Branch, and Einstein.
- Interventions for High-Risk Individuals: A New York City health insurance plan used the COVID-19 Vulnerability Index to identify high-risk members and arranged for groceries and prescription medications to be delivered to their homes, helping them avoid exposure to the virus.