Diabetes Risk Predictor

1 in 10 Americans has diabetes. What if we could predict it earlier?
Healthcare ML Python R scikit-learn

The Problem

Diabetes affects 37 million Americans and costs the healthcare system $327 billion annually. The tragedy is that Type 2 diabetes is often preventable — if caught early. But current screening relies on patients showing symptoms, which means many are diagnosed too late for lifestyle interventions to be effective.

The question: Can we use readily available patient data (BMI, blood pressure, age, family history) to flag at-risk individuals before symptoms appear?

The Approach

📊
EDA
Feature analysis
🔧
Engineer
Clinical features
🧰
Model
LR + RF + XGB
💡
Interpret
Feature importance

Key Results

85%
Accuracy
3
Models Compared
High
Sensitivity

Business Value

Healthcare Cost Reduction: Early intervention for pre-diabetic patients saves an estimated $3,000–$5,000 per patient per year in avoided complications. Screening Scale: An automated risk model can screen thousands of patients per hour vs. manual clinical assessment. Explainability: Feature importance analysis means clinicians trust and adopt the tool.

Tech Stack

Python R scikit-learn Pandas Matplotlib / Seaborn XGBoost
← Back to all projects