This project implements various machine learning classification approaches to predict heart disease using a heart disease dataset.
The project implements and compares five different classification approaches:
- Logistic Regression
- Decision Tree
- Random Forest
- Stochastic Gradient Descent (SGD)
- Support Vector Machine (SVM)
- Dataset preprocessing and cleaning
- Implementation of multiple classification algorithms
- Hyperparameter tuning using GridSearchCV
- Feature importance analysis and feature selection experiments
- Model performance comparison and evaluation
heart.csv: The dataset used for analysisEDA_Data_preeprocessing.ipynb: Jupyter notebook containing all analysis and model implementationspyproject.toml: Project dependencies and configuration
- Python 3.x
- Required packages are listed in pyproject.toml
- Dataset description and feature explanation
- Data preprocessing and cleaning
- Model implementation and training
- Hyperparameter tuning using GridSearchCV
- Feature selection analysis and model re-evaluation