This project focuses on applying Machine Learning models for predictive healthcare analytics.
The goal is to assist in early diagnosis and risk prediction using structured medical datasets.
Healthcare providers face challenges in predicting diseases early due to:
- Large volumes of patient data
- Noisy & imbalanced datasets
- The need for interpretable predictions
This project applies ML models to predict outcomes, compare performance, and generate visual + PDF reports for decision support.
- Source: Public healthcare dataset (e.g., Pima Indians Diabetes Dataset)
- Features: Patient health attributes (glucose, BMI, blood pressure, etc.)
- Target: Binary classification (disease present / not present)
-
Clone the repo:
git clone https://github.com/akshataundri/healthcare-predictive-analytics.git cd healthcare-predictive-analytics -
Create environment & install dependencies:
python -m venv venv source venv/bin/activate # Linux/Mac venv\Scripts\activate # Windows pip install -r requirements.txt
-
Run training:
python src/train.py
-
Generate evaluation report:
python src/evaluate.py
🚀 Future Work
Hyperparameter tuning
Integration with real-time healthcare systems
Deployment as an API for hospitals/clinics
Explainable AI (feature importance, SHAP values)