Titanic ML is a machine learning project aimed at predicting the survival of passengers on the Titanic using various machine learning models. The project includes data ingestion, transformation, model training, and evaluation.
To set up the project locally, follow these steps:
- Clone the repository:
git clone https://github.com/8BitButter/Titanic---Machine-Learning-from-Disaster
- Navigate to the project directory:
cd Titanic_Project
- Install the required dependencies:
pip install -r requirements.txt
To use the project, follow these steps:
-
Ensure you have a MySQL database set up with the necessary data.
-
Create a
.env
file in the root directory with the following content:host=your_host user=your_user password=your_password db=your_database
- Data ingestion from a MySQL database.
- Data transformation using custom pipelines.
- Model training with various algorithms including Random Forest, Decision Tree, Gradient Boosting,Linear Regression, XGBoost, CatBoost, and AdaBoost.
- Model evaluation and selection based on performance metrics.
- Saving the best model for future predictions.