VishalKumar-S
diff --git a/‎Readme.md‎
Lines changed: 37 additions & 15 deletions b/‎Readme.md‎
Lines changed: 37 additions & 15 deletions
diff --git a/‎pipelines/__pycache__/ci_cd_pipeline.cpython-310.pyc‎
-48 Bytes b/‎pipelines/__pycache__/ci_cd_pipeline.cpython-310.pyc‎
-48 Bytes
diff --git a/‎run_pipeline.py‎
Lines changed: 4 additions & 7 deletions b/‎run_pipeline.py‎
Lines changed: 4 additions & 7 deletions
diff --git a/‎src/__pycache__/clean_data.cpython-310.pyc‎
-12 Bytes b/‎src/__pycache__/clean_data.cpython-310.pyc‎
-12 Bytes
diff --git a/‎src/__pycache__/train_models.cpython-310.pyc‎
-39 Bytes b/‎src/__pycache__/train_models.cpython-310.pyc‎
-39 Bytes
diff --git a/‎steps/__pycache__/alert_report.cpython-310.pyc‎
-9 Bytes b/‎steps/__pycache__/alert_report.cpython-310.pyc‎
-9 Bytes
diff --git a/‎steps/__pycache__/clean_data.cpython-310.pyc‎
641 Bytes b/‎steps/__pycache__/clean_data.cpython-310.pyc‎
641 Bytes
diff --git a/‎steps/__pycache__/ingest_data.cpython-310.pyc‎
-1.22 KB b/‎steps/__pycache__/ingest_data.cpython-310.pyc‎
-1.22 KB
diff --git a/‎steps/__pycache__/predict_prod_data.cpython-310.pyc‎
2.04 KB b/‎steps/__pycache__/predict_prod_data.cpython-310.pyc‎
2.04 KB
diff --git a/‎steps/__pycache__/production_batch_data.cpython-310.pyc‎
-610 Bytes b/‎steps/__pycache__/production_batch_data.cpython-310.pyc‎
-610 Bytes
@@ -1,7 +1,9 @@
 # Sales Conversion Optimization Project 📈
 
-# Table of Contents 📑
+**Deployed Application: [Sales Conversion Optimisation Web App](https://sales-conversion-optimization-mlops-project.streamlit.app/)**
+
 
+# Table of Contents 📑
 
 1. [Project Description](#project-description) 📝
 2. [Project Structure](#project-structure) 🏗️
@@ -25,6 +27,10 @@ I've structured this project to streamline the process from data ingestion and c
 
 This project aims to streamline your sales conversion process, providing insights and predictions to drive impactful business decisions! 📊✨
 
+**Live Demo Walkthrough**:  
+[![Live Demo Walkthrough](https://img.youtube.com/vi/PfnZFzvqHFs/0.jpg)](https://www.youtube.com/watch?v=PfnZFzvqHFs)
+
+
 <a id="project-structure"></a>
 # Project Structure 🏗️
 
@@ -134,7 +140,7 @@ Here's how it flows:
 
 1. **ci-cd.py**: Triggered to initiate the CI/CD pipeline.
 2. **steps/production_batch_data**: Accesses production batch data from the Production_data folder
-3. **pipelines/ci_cd_pipeline.py**: As we already discussed earlier, we conduct Data Quality, Data Drift as previously we did, if threshold fails, email reports are sent.
+3. **pipelines/ci_cd_pipeline.py**: As we already discussed earlier, we conduct Data Quality, Data stability tests, Data drift, model performance validation tests as previously we did, if threshold fails, email reports are sent.
 4. **steps/predict_production_Data.py**: Utilizes the pre-trained best model to make predictions on new production data. Then, we conduct Model Performance validation as previously we did, if threshold fails, email reports are sent.
 
 This pipeline is crucial for maintaining a continuous and reliable deployment process. 🔁✨
@@ -208,8 +214,8 @@ This app streamlines the process of making predictions, interpreting model outpu
 
 ## Interpretability Section
 - 📝 **Detailed Interpretability Report**: View global interpretability metrics.
-- 🌐 **SHAP Global Plot**: Explore SHAP values at a global level.
-- 🌍 **SHAP Local Plot**: Visualize SHAP values for user-input data.
+- 🌐 **SHAP Global Plot**: Visualize SHAP values at a global level.
+- 🌍 **SHAP Local Plot**: Visualize SHAP values for the user-input data in the Prediction App.
 
 ![SHAP Report:](assets/shap_local_plot.PNG)
 
@@ -275,8 +281,6 @@ This application provides an intuitive interface for users to make predictions a
 <a id="neptune.ai-dashboard"></a>
 # Neptune.ai Dashboard 🌊
 
-## Utilising the Power of Neptune.ai for Enhanced Insights and Management 🚀
-
 Neptune.ai offers an intuitive dashboard for comprehensive tracking and management of experiments, model metrics, and pipeline performance. Let's dive into its features:
 
 1. **Visual Metrics**: Visualize model performance metrics with interactive charts and graphs for seamless analysis. 📈📊
@@ -364,23 +368,41 @@ Docker is an essential tool for packaging and distributing applications. Here's
 
 
 <a id="github-actions"></a>
-# GitHub Actions Workflow and Continuous Machine Learning (CML) Reports 📊
+# GitHub Actions and CML Reports 📊
+
+My project integrates GitHub Actions for Continuous Integration and Continuous Deployment (CI/CD), automating testing and deployment processes whenever changes are pushed to the repository.
+
+## Workflow Overview 🔍
+The CI/CD pipeline automatically runs on every code push, performing the following steps:
+
+### Environment Setup 🛠️
+
+1. Checks out the latest code
+2. Installs all dependencies from requirements.txt
+
+### ZenML Configuration 📊
+
+1. Registers the Neptune experiment tracker
+2. Creates and sets the ZenML stack with Neptune integration
+
+
+### Pipeline Execution 🚀
+
+1. Runs the CI/CD pipeline script with secure environment variables
+2. Handles sensitive information (email password, API tokens) using GitHub Secrets
 
-## CML Reports Integration 🚀
 
-🎯 Predictions Scatter Plot: Visualizes model predictions against actual conversions.
-📈 Residuals Plot: Illustrates the differences between predicted and actual values.
+### CML Reporting 📈
 
-## GitHub Actions Workflow 🛠️
+1. Generates visual reports using Continuous Machine Learning (CML)
+2. Creates prediction scatter plots (model predictions against actual conversions) and residuals plots (differences between predicted and actual values).
+3. Publishes results as comments directly in GitHub
 
-Integrated into CI/CD pipeline:
-- Automatic generation on every push event.
-- Visual insights available directly in the repository.
 
 ![Predictions Scatter Plot](CML_Reports/predictions_scatter_plot.png)
 ![Residuals Plot](CML_Reports/residuals_plot.png)
 
-🌟 These reports enhance transparency and provide crucial insights into model performance! 🌟
+🌟 So my CI/CD approach eliminates manual testing and deployment steps and provides visual feedback on model performance and these reports enhance transparency and provide crucial insights into model performance! 🌟
 
 
 <a id="running-the-project"></a>
 
@@ -1,14 +1,11 @@
 from pipelines.training_pipeline import train_pipeline
 
-def run_training_pipeline(url: str):
+def run_training_pipeline(path: str):
     """
     Runs the training pipeline.
-    
-    Args:
-        url (str): URL of the dataset to be used for training.
     """
-    train_pipeline(url)
+    train_pipeline(path)
 
 if __name__ == "__main__":
-    dataset_url = "https://sale2.s3.us-east-2.amazonaws.com/KAG_conversion_data.csv"
-    run_training_pipeline(dataset_url)
+    dataset_path = "data/KAG_conversion_data.csv"    
+    run_training_pipeline(dataset_path)