This project builds a Machine Learning model to predict whether a tumor is malignant or benign using the Breast Cancer Wisconsin Dataset. It also includes a Plotly-based interactive dashboard for data exploration and visual insights.
- 🔍 Exploratory Data Analysis (EDA)
- 🧪 Logistic Regression Classifier
- 📈 ROC Curve, Confusion Matrix, and Classification Report
- 🖼️ Plotly/Dash Dashboard for interactive visualization
- 💾 Dataset:
Breast Cancer Wisconsin (Diagnostic)on Kaggle
Cancer-Detection/
│
├── app.py # Plotly Dashboard (Dash App)
├── playground.ipynb # Colab/EDA Notebook
├── model.pkl # Trained ML model
├── cancer\_data.csv # Dataset file
├── requirements.txt # All dependencies
└── README.md # Project info
- Data Preprocessing
- Handling missing values
- Feature scaling (StandardScaler)
- Model Training
- Logistic Regression
- Evaluation
- Accuracy, Confusion Matrix, ROC-AUC
- Dashboard
- Visualize predictions interactively
- Confusion Matrix with Heatmap
- ROC Curve
- Scatter Matrix & Feature Correlations
- Live Prediction Dashboard
Install all required libraries:
pip install -r requirements.txtOr manually install:
pip install pandas scikit-learn matplotlib seaborn plotly dashjupyter notebook playground.ipynbpython app.pyIt will start a server on http://127.0.0.1:8050.
Siddhant 📧 [shaurya.sid1729@gmail.com] 🔗 GitHub
If you like this project, consider ⭐ starring the repo!
---
Would you like me to:
- Include the **Streamlit** version?
- Add a **badge section** (like GitHub stars, license, etc.)?
- Generate a `requirements.txt` for you?
Let me know!


