Interactive Cancer Risk Predictor (ML & DL)

An interactive web application for breast cancer risk prediction using Machine Learning and Deep Learning models. Built with Streamlit and tracked using MLflow.

Features

Upload CSV files for batch predictions
Visualize model predictions with SHAP values
Adjust confidence thresholds for classification
Compare performance between ML and DL models

Project Structure

├── streamlit_app.py        		# Streamlit web application
├── models/
│   ├── deep_learning_model.h5     # Deep Learning model
├── random_forest_model.pkl     	# Random Forest model
├── deep_learning_model.h5      	# Deep Learning model
├── stacking_model.pkl          	# Stacking model
├── bagging_model.pkl          		# Bagging model
├── svm_model.pkl               	# SVM model
├── mlruns/                        # MLflow tracking directory
├── Dockerfile                     # Docker configuration for containerization
├── requirements.txt               # Python dependencies
├── .gitignore                     # Git ignore file
└── README.md                      # Project documentation

Installation

Clone the repository:

git clone https://github.com/sivkri/interactive-cancer-risk-predictor-ml-dl.git
cd interactive-cancer-risk-predictor-ml-dl

Install dependencies:
```
pip install -r requirements.txt
```

Usage

Run the Streamlit application:
```
streamlit run app/streamlit_app.py
```
Open the provided local URL in your browser to interact with the application.

Docker Setup

To run the application in a Docker container:

Build the Docker image:
```
   docker build -t DockerFile .
```
Run the Docker container:
```
docker run -p 8501:8501 DockerFile
```
Access the application at http://localhost:8501

MLflow Guide

MLflow is used to track experiments, log parameters and metrics, and save trained models.

🔹 1. Start the MLflow UI

   mlflow ui

This will launch the MLflow Tracking UI at (http://localhost:5000)

🔹 2. Track a Model Run in Python

   import mlflow
   import mlflow.sklearn
   with mlflow.start_run():
      mlflow.log_param("model_type", "RandomForest")
      mlflow.log_param("n_estimators", 100)
      mlflow.log_metric("accuracy", 0.95)
      mlflow.sklearn.log_model(rf_model, "model")

This will: - Log hyperparameters (model_type, n_estimators) - Log a performance metric (accuracy) - Save the trained model as an artifact

🔹 3. View Your Runs in the MLflow UI Visit

  To explore: - Runs and experiments - Parameters and metrics - Downloadable models

🔹 4. Track Multiple Experiments To create a named experiment:

   mlflow.set_experiment("CancerRiskPrediction")

To log to this experiment:

   with mlflow.start_run(run_name="RandomForest_Trial_1"): ...

🔹 5. Log Custom Artifacts (like plots, confusion matrices)

   import matplotlib.pyplot as plt
      plt.plot([0, 1], [0.5, 0.9])
      plt.savefig("plot.png")
      mlflow.log_artifact("plot.png")

MLflow Files

Experiment logs are stored under the mlruns/ directory - Each run contains logged metrics, parameters, models, and artifacts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Interactive Cancer Risk Predictor (ML & DL)

Features

Project Structure

Installation

Usage

Docker Setup

MLflow Guide

🔹 1. Start the MLflow UI

🔹 2. Track a Model Run in Python

🔹 3. View Your Runs in the MLflow UI Visit

🔹 4. Track Multiple Experiments To create a named experiment:

🔹 5. Log Custom Artifacts (like plots, confusion matrices)

MLflow Files

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
mlruns		mlruns
models		models
.gitignore		.gitignore
DockerFile		DockerFile
LICENSE		LICENSE
README.md		README.md
bagging_model.pkl		bagging_model.pkl
deep_learning_model.h5		deep_learning_model.h5
interactive_cancer_risk_predictor.ipynb		interactive_cancer_risk_predictor.ipynb
random_forest_model.pkl		random_forest_model.pkl
requirements.txt		requirements.txt
scaler.pkl		scaler.pkl
stacking_model.pkl		stacking_model.pkl
streamlit_app.py		streamlit_app.py
svm_model.pkl		svm_model.pkl

License

sivkri/interactive-cancer-risk-predictor-ml-dl

Folders and files

Latest commit

History

Repository files navigation

Interactive Cancer Risk Predictor (ML & DL)

Features

Project Structure

Installation

Usage

Docker Setup

MLflow Guide

🔹 1. Start the MLflow UI

🔹 2. Track a Model Run in Python

🔹 3. View Your Runs in the MLflow UI Visit

🔹 4. Track Multiple Experiments To create a named experiment:

🔹 5. Log Custom Artifacts (like plots, confusion matrices)

MLflow Files

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages