MIT Solve

Solution Overview & Team Lead Details

What is the name of your organization?

Institute for Technology & Global Health

What is the name of your solution?

Community Data Health Initiative

Provide a one-line summary of your solution.

AI-driven city-level data discovery and analytics for improved population health outcomes

If your solution has a website or an app, provide the links here:

N/A

What specific problem are you solving?

Cross-sector, cross-agency data harmonization has demonstrated enormous potential for population-level health outcomes, but the practice is not common in U.S States and cities, limiting the ability of healthcare providers to make the most effective decisions to protect the health of their populations. The global pandemic (COVID-19), followed by multiple disease outbreaks in communities across the US, including Polio which was thought to have been eradicated, highlighted the urgency with which targeted, hyper-local strategies are needed to prioritize resource deployment. Robust data systems and the use of advanced technologies like artificial intelligence (AI) can help in this process. Currently:

Cities have not fully explored the use of integrated, non-traditional data sources to get a complete landscape overview and which helps to identify communities with the most urgent needs. These data sources could include consumer wearables and IoT devices, sentinel data from satellites, environmental and ecological data, wastewater surveillance, cybersecurity, and data from grass roots organizations)
Local governments have not harmonized data across their several agencies (education departments, health departments, energy and the environment etc.) to establish ground truth. Accepted convention by practitioners is that this process is not possible, yet, through this initiative, we are demonstrating that it can be done.
While some cities are exploring the use of technologies (IoT devices), the haphazard approaches undertaken may provide little utility because of non-standardized approaches
Cities are not effectively building and leveraging relationships between the public, private, non-profit, and academic communities. Historical approaches have been public-private or public-academic, not the trifecta
Cities are not leveraging the power of technologies like Artificial Intelligence and Machine Learning in health, to parse multi-source, big data, to provide relevant insights for health decision and policy making
Cities are not investing sufficient resources to better understand the ethical implications for the use of novel and advanced technologies in public health

Pooling data across agencies and community programs helps paint a more nuanced and holistic picture of health determinants which helps to formulate targeted interventions that addresses the root causes of health disparities.

Cities need to become more efficient with resource allocation, early detection and intervention, improved care coordination, and enhanced research and innovation, all of which can be enabled through the applications of artificial intelligence in secure, privacy-preserving ways.

What is your solution?

Our solution has two parts:

1) The Community Data Health Initiative is currently working with cities to build internal capacity for data discovery and analytics, identify and address data and personnel siloes, and build community engagement infrastructure. Each pilot city first established working groups comprising city and county officials (mayors offices), chief data officers, public health and scientific experts, community organizations - including grassroots groups and activists, and researchers from academic institutions to a) Chart a path for data informed decision and policy making b) Understand and map risks c) Establish variables as indicators of outcomes of interest d) Outline data needs and their owners e) Establish interagency and organizational data commitments, data sharing mechanisms, and data safety, including ethical analyses f) Develop and establish evaluation methodologies and analytics, g) Create visual data systems.

We are breaking down historically accepted conventions that it is impossible to bridge interagency data systems and establish hyperlocal (city level) data sharing mechanisms. Five cities are currently piloting our initiative: Washington DC, Baton Rouge, Denver, and Baltimore.

2) The use of AI: As we set up data infrastructure, we are working with cities to address the questions of what data are needed, who owns it, how can it be shared, legal and regulatory issues, standardization and formatting issues, ethical concerns, storage and security concerns, who will have access, and how AI will be used to improve efficiencies and health outcomes.

Our cities will be using AI in the following ways:

Micro-Population Health Interventions:

We are better able to micro target subgroups within populations, who need urgent help. By analyzing comprehensive data sets, we are better able to optimize, down to street levels, who is most in need of resources. Traditionally, health interventions and programs relied almost exclusively on the combination of social factors and health records for targeting. AI makes it possible to analyze very large datasets and include other confounding factors in it's analyses including climate change effects data, traffic and pollution data, critical service gaps like housing, insurance, and transportation, and proximity to environmental hazards, to identify specific areas within communities with the most need. By omitting these other data points in historical analyses, health programs often missed the individuals who needed the most help.

Predictive Analytics for the identification and prevention of disease:

In collaboration with our partners, we are using AI to analyze historical data to identify patterns and to predict potential outbreaks within communities. Using the combination of cross-agency and shared community data sources provides a more comprehensive overview of ground conditions and the algorithms are able to predict outcomes with a higher degree of accuracy.

Over time and with more data, the solution will be able to help with:

Earlier detection of risks to population health
Discovering behavioral patterns that affects health outcomes and prescribing interventions
Monitor community health indicators
Generate more comprehensive health insights

Who does your solution serve, and in what ways will the solution impact their lives?

CDHI is currently piloting in 4 cities in the US, Including:

Baltimore, Maryland:

Baltimore: Population - 576, 498, roughly 70% minority, life expectancy differs by up to 20 years between neighborhoods, and 23% living below the poverty level.

Washington DC:

Population - 712, 816, roughly 60% minority, 27 year difference in life expectancy between neighborhoods, about 30% of black population living below poverty

Baton Rouge, Louisiana:

With a population of 222, 185, the city of Baton Rouge does not have a health department. 25% of the total population live below the poverty line, however, within minority communities, that rate increases to about 56%.

By sharing cross agency data and data from devices such as air pollution sensors, then visualizing the data by using mapping technologies, each city-level working group can get a clear overview of ground conditions, including which communities are most at risk. For example, using AI to analyze multi agency datasets and confounding factors such as areas with lowest rates of health insurance, PM 2.5 levels, life expectancy, and heart health, points to neighborhoods and specifically, which streets, are most in need and that are often overlooked in city budgets and resource prioritization.

How are you and your team well-positioned to deliver this solution?

The initiative in each pilot city is led by the active collaboration between small, local community organizations, city agencies, and governments, to identify health challenges, establish potential solutions, and work together to include all relevant stakeholders. This includes:

Students and local academic institutions
Grass roots and community organizations
Government agencies and staff
Domain experts across both the public and private sectors

The working groups convene regularly (weekly) to evaluate progress and iterate on processes to ensure compliance and progress towards established goals.

ITGH has been leading the topic group on “AI for Outbreaks'' within the joint World Health Organization and International Telecommunications Union working group on “AI for Health,” over the past three years to benchmark the use of AI and ML in disease outbreak and to develop standards for the use of the technologies in that context. The institute has also been conducting research on the use of novel technologies to improve population health outcomes and has published academic papers on related topics including policy and ethics. Lessons from these activities helps to inform the design of this initiative.

The team is international and boasts a diverse background across race, gender, nationality, and experiences including working on solutions in global and undeveloped contexts. The ITGH team has broad experience with implementing digital health solutions, working with governments across all levels - from hyperlocal to federal, and both within the US and globally. Importantly, the team is composed of domain experts who are interested in both research and practice and has multiple active collaborations with leading academic institutions, research centers, local community groups, and multilateral organizations.

Which dimension of the Challenge does your solution most closely address?

Collecting, analyzing, curating, and making sense of big data to ensure high-quality inputs, outputs, and insights.
Using data sharing and interoperability of systems.

In what city, town, or region is your solution team headquartered?

Cambridge, MA, USA

What is your solution’s stage of development?

Pilot: An organization testing a product, service, or business model with a small number of users

In which of the following areas do you most need partners or support?

Monitoring & Evaluation (e.g. collecting/using data, measuring impact)
Product / Service Distribution (e.g. delivery, logistics, expanding client base)
Technology (e.g. software or hardware, web development/design)

Who is the Team Lead for your solution?

Khahlil Louisy

More About Your Solution

What makes your solution innovative?

There are several distinguishing factors that make this solution innovative:

A departure from traditional approaches. Health practitioners typically rely on "health data" which includes electronic health records, clinical data, hospital admissions, insurance claims, and medical histories, along with some social determinants data, in making decisions. Many more factors affect health outcomes and by including these data points in our analyses, we can get a much more comprehensive overview of what is actually taking place within communities.
Truly integrated data ecosystem: cross-agency data sharing has historically been a major problem for both policy-makers and health practitioners because there have either been limited or no mechanisms for doing so. By starting at the hyperlocal levels, we are facilitating a path to bridging the divide and with representatives from all relevant agencies and groups.
Data Fusion and Analysis: The machine learning algorithms enables identifying complex patterns, correlations, and trends that otherwise would not be apparent through traditional methods of analyses
Analytics for Proactive Interventions: The predictive capacity of AI enables the identification of health risks and trends before they escalate, giving health professionals sufficient time to devise containment strategies.
True cross-sector collaborations outside of emergencies: many health departments, agencies, and community groups work in silos. This initiative establishes new pathways for active collaborations and engagements across these groups, which could lead to more cohesive and comprehensive strategies and interventions targeting health disparities.

The solution is comprehensive in scope, addressing the challenge of multi-stakeholder inclusion in the decision making process - from small, grass roots organizations all they way up to mayors offices, and the sharing of the data that they collect.

Through the initiative, the combination of these stakeholders decide together on which health challenges to focus on within their communities and the relevant data needed to address them. Importantly, the support of the mayors offices helps to facilitate the mechanisms through which data sharing is done.

Incorporating technologies like AI and ML in multi-sourced city-level data and using visualizations from mapping technologies to generate unique insights is novel. We expect this model to be replicated by other cities and indeed, it is already generating strong interest.

How does your solution address or plan to address UN Sustainable Development Goal 3 for Good Health and Well-Being?

Our solution contributes to the UN Sustainable Development Goal 3 in three different ways:

Universal Health Coverage:

By using AI to identify health disparities and areas or groups with the most need, we can optimize resource allocation and develop targeted interventions, to move closer to achieving universal health coverage i.e. ensuring that all members of a given population have equal access to the healthcare that they need.

Health Security and reducing inequality:

The capability AI provides for early threat detection, which improves response, helps in the prevention of disease. Further, by analyzing larger, yet community-specific datasets, helps identify and address health inequities which ultimately reduces health disparities.

Data-Driven Decision Making and Capacity Building:

Designing informed policies and interventions helps to enhance the effectiveness of health systems. By building teams across sectors and agencies, we are helping to address efficiency issues and establish broader support in the delivery of health services, building capacity in the process.

Describe the AI components and underlying data that powers your solution.

For a comprehensive system that enables AI to accurately analyze, predict, and enhance health outcomes, we are using the following features and elements:

Big Data and Sources: Combination of data across multiple agencies, departments, organizations, groups, and individuals, including behavior, environmental, and social determinants data.

Integration and Interoperability: facilitating aggregation and integration of the diverse datasets

Machine Learning Algorithms: Used for predictive analytics, classification and clustering, and Natural Language Processing for extracting insights from the unstructured data.

Deep-Learning Models: Including convoluted neural networks and recurrent neural networks with image recognition and time-series analyses, while feature learning helps extract relevant features from the complex datasets

Continuous Learning and Improvement Mechanisms: Feedback loops for incorporating feedback mechanisms for continuous learning.

We are looking into how differential privacy and federated learning can be incorporated into these systems for privacy and protection.

How are you ensuring ethical and responsible use of AI in your work? How are you addressing or mitigating potential risks in your solution?

Data governance and compliance, security, transparency and explainability, community engagement and participation, and bias mitigation are all key features of this bold undertaking.

Teams across each pilot city are diverse and representative, adopt ethical guidelines and adhere to standards. Anonymization and deidentification are central to all data collection practices and robust practices are in place to do so.

ITGH has published extensively on the ethics of AI and it's applications to health and is well positioned to guide implementations and develop best-practice guidelines with partners. These include:

What are your impact goals for the next year and the next five years, and how will you achieve them?

Goals for the next year with four pilot cities include:

1. Create models for cities and counties to address health issues in a cross-agency manner, including housing, code enforcement, infrastructure investment and public health.
2. Identify means of testing and sensing, that produce the most deleterious effects on residents.
3. Suggest intervention strategies that combine analytics, IoT information and community generated data.
4. Design for solutions for local officials, and in particular members of the African American Mayors Association (partners)
5. Develop visualized spatial narratives that bring urgent attention to the problem and show where interventions will make the most difference.
6. Develop communications strategies that incorporate community sentiments in identifying neighborhood concerns related to environment and health.

Through the achievement of these goals, we anticipate members of the communities most impacted through omission in resource distribution, to begin experiencing change.

Goals for the next five years include significant expansion of pilot cities, while continuing to iterate on existing processes and achieving growth in initial pilot locations

Your Team

Your Operational Plan & Funding

What is your operational model and plan?

The initiative is being designed and implemented with a core diverse group of partners - the Institute for Technology and Global Health (ITGH), the Data-Smart City Solutions lab at the Harvard Kennedy School, the Environmental Defense Fund (EDF), ESRI, the African American Mayor's Association, and public health researchers from Johns Hopkins and Howard University. Pilot cities identify and bring to the table all relevant agencies, community organizations, individuals, and organizations whose inclusion are vital.

Operationally, ITGH's role in the initiative is solution design, facilitation, analytics, and evaluation. Further, we structure research to assess the impact of the initiative and prepare cases for relevant stakeholders. The Environmental Defense Fund manages the inclusion and support from subject matter experts. The Data Smart City Solutions team helps to identify key stakeholders and establishes relationships. The African American Mayors Association establishes and manages relationships with mayors and the inclusion of HBCUs. ESRI helps with data, mapping, and visualization needs, technologists and researchers help with the development and structure of the technological tools. Implementations are managed by city leadership

Project managers and associates lead weekly meetings with pilot cities to facilitate the movement of the initiative forward. Convenings are held in the initial phase of the initiative to:

Conduct needs assessments and defining goals
Stakeholder engagement
Agree on data needs, collection methodologies, and integration
Data governance and security
Technology infrastructure needs
Community engagement and education
Implementation planning
Oversight and governance
Public advocacy
Scaling and replicability
Monitoring and evaluation
Capacity building

Because buy-in is at all levels of the city, from advocacy groups to the mayors' offices, access to tools and resources needed to implement the solution is more easily available.

Once the solution has been fully implemented and pilot cities take full ownership, our role will evolve to consultants to ensure continued progress and to help shape future iterations.

What is your plan for becoming financially sustainable?

The initial phases of the initiative in each city is currently supported by grants and funding Robert Wood Johnson Foundation.

Because we want cities to take full ownership of the program after it has been launched and fully integrated into their ecosystem, further funding is built into their operational budgets. Project partners take on a different role beyond that point.

We anticipate that funding to scale the initiative to other cities (and there is a pipeline of cities that have expressed interest), will continue to come from grants. However, we also anticipate long-term maintenance and consulting contracts with local-government to continue evolving the solution within their cities.

What are your current operating costs, and what are your projected operating costs for the next year? Please include human capital estimates.

ITGH's portion of operational costs primarily consists of travel and lodging to the pilot cities (approximately $27,000) and human capital in the form of fellowships ($35,000), for an approximate total of $62,000.

We expect this figure to remain the same over the next year.

Applicants can request and receive funding at a minimum of 50k and maximum of $100k. How much funding are you seeking to continue your work in 2024, and how did you select this number? What would you use this funding for? Funding is limited; please consider carefully the right amount to request.

ITGH is seeking funding of $62, 000k for this effort, which will include support for :

- Travel for team members to pilot cities

- Continuous convening and engagements with communities

- human capital in the form of fellowships

The Cure Residency will provide winners with seed funding, mentorship, lab space, mentorship, educational programming, and networking opportunities. How do you imagine this opportunity will help support your work? Which aspects of the Cure Residency would you be most excited about?

The 1 year residency within an ecosystem of other practitioners and experts who are aligned on a shared mission to improve population level health outcomes is incredibly exciting. Novel and innovative solutions do not often emerge from silos and cross pollination of work can lead to powerful breakthroughs in solving complex challenges.

ITGH is a team of junior practitioners and researchers and the opportunity for funding to execute our mission and mentorship from experienced experts and executives would be invaluable. Because we are now actively engaged with New York City's health authority, having a residency in the city would be exciting and help with efficiency.