Skip to content

Commit 7e85e51

Browse files
authored
Merge pull request #1358 from LitZeus/add-diabetes-detection
Added Multimodal Diabetes Detection using 3 ensemble models
2 parents 4fd2cae + 9299d67 commit 7e85e51

File tree

7 files changed

+11100
-0
lines changed

7 files changed

+11100
-0
lines changed
Lines changed: 83 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,83 @@
1+
# Multimodal Diabetes Detection Model- Using 3 Ensemble Models
2+
3+
## Table of Contents
4+
- [Overview](#overview)
5+
- [Use Case](#use-case)
6+
- [Benefits](#benefits)
7+
- [Dataset](#dataset)
8+
- [Models Used](#models-used)
9+
- [Usage](#usage)
10+
- [Model Performance Metrics](#model-performance-metrics)
11+
- [Future Work](#future-work)
12+
- [Contribution Guidelines](#contribution-guidelines)
13+
- [Contact Information](#contact-information)
14+
15+
## Overview
16+
17+
This project aims to develop a diabetes detection system using the Pima Indians Diabetes dataset. It leverages three ensemble models: Stacking, Soft Voting, and Hard-Soft Voting to improve the accuracy and reliability of predictions.
18+
19+
## Use Case
20+
21+
The implemented feature enhances the project by:
22+
- Improving predictive accuracy for diabetes detection.
23+
- Providing more reliable and robust predictions.
24+
- Facilitating better decision-making in healthcare applications.
25+
26+
## Benefits
27+
28+
The benefits of this feature include:
29+
- Enhanced model robustness through ensemble learning.
30+
- Increased reliability in diabetes detection.
31+
- Contribution to improved patient outcomes in healthcare applications.
32+
- A valuable resource for the data science and machine learning community.
33+
34+
## Dataset
35+
36+
The project utilizes the Pima Indians Diabetes dataset from Kaggle. Below are the paths to the dataset files:
37+
38+
- `kaggle/input/pima-indians-diabetes-database/diabetes.csv`
39+
- `kaggle/input/pimaindiansdiabetesdata/pima-indians-diabetes.data.csv`
40+
- `kaggle/input/diabetes-data-set/diabetes.csv`
41+
- `kaggle/input/pima-diabetes/pimaindians-diabetes.data.csv`
42+
- `kaggle/input/pimaindiansdiabetescsv/pima-indians-diabetes.csv`
43+
44+
## Models Used
45+
46+
1. **Stacking Model:** Combines multiple classifiers with a meta-learner for final predictions.
47+
2. **Soft Voting Model:** Averages the predicted probabilities from different models.
48+
3. **Hard-Soft Voting Model:** Integrates both hard voting (majority class) and soft voting (probability-based).
49+
50+
## Usage
51+
52+
1. Open the Jupyter notebook provided in this project.
53+
2. Load the dataset.
54+
3. Preprocess the data as needed.
55+
4. Train the ensemble models.
56+
5. Evaluate the model performance.
57+
58+
## Model Performance Metrics
59+
60+
The models are evaluated using the following metrics:
61+
- Accuracy
62+
- Precision
63+
- Recall
64+
- F1 Score
65+
66+
## Future Work
67+
68+
- Explore additional features for better model performance.
69+
- Implement more advanced ensemble techniques.
70+
- Conduct a comparative analysis with other machine learning algorithms.
71+
72+
## Contribution Guidelines
73+
74+
Contributions are welcome! Please follow these steps to contribute:
75+
1. Fork the repository.
76+
2. Create a new branch.
77+
3. Make your changes.
78+
4. Submit a pull request.
79+
80+
## Contact Information
81+
82+
For questions or feedback, please reach out to [email protected]
83+

Detection Models/MultiModel-Diabetes-Detection/diabetes-three-ensemble-models.ipynb

Lines changed: 7175 additions & 0 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)