Skip to content

Conversation

darksapien23151
Copy link

@darksapien23151 darksapien23151 commented Jun 10, 2025

🛠 Summary

GSOC Contribution for the project - Optimized serving endpoint for classical Machine Learning models.
Current status: Finished working on Logistic Regression and KMeans.

Mentors:
Zeglarski, Milosz
Trawinski, Dariusz

@mzegla mzegla self-assigned this Jun 10, 2025
@mzegla mzegla added the GSoC Contributions that are part of Google Summer of Code projects label Jun 10, 2025
@@ -0,0 +1,2 @@
SepalLength,SepalWidth,PetalLength,PetalWidth
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess we don't need that in the repo since we have an instruction on preparing the data.

@@ -0,0 +1,508 @@
SepalLength,SepalWidth,PetalLength,PetalWidth,Species
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess we don't need that in the repo since we have an instruction on preparing the data.


output_dir = sys.argv[1]

#os.makedirs(output_dir, exist_ok=True)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's remove the comment

@mzegla mzegla requested a review from Copilot September 1, 2025 09:19
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR introduces a complete machine learning pipeline project for the Iris dataset using OpenVINO Model Server (OVMS) with MediaPipe. The project supports both logistic regression and k-means clustering models with training and inference capabilities through client scripts.

Key changes:

  • Complete pipeline implementation with PyTorch logistic regression and scikit-learn k-means models
  • Client scripts for training and inference with flexible parameter handling
  • Docker-based deployment with Intel optimization libraries
  • Data preprocessing utilities and example datasets

Reviewed Changes

Copilot reviewed 15 out of 15 changed files in this pull request and generated 5 comments.

Show a summary per file
File Description
extras/iris_pipeline_project/pipeline/ovmsmodel.py Core OVMS Python model implementing training/inference logic for both model types
extras/iris_pipeline_project/pipeline/model.py Abstract model classes with PyTorch logistic regression and scikit-learn k-means implementations
extras/iris_pipeline_project/pipeline/graph.pbtxt MediaPipe graph configuration for the pipeline
extras/iris_pipeline_project/model_config.json OVMS model configuration
extras/iris_pipeline_project/labelmap.json Label mapping for Iris species
extras/iris_pipeline_project/kmeans_param.json K-means hyperparameters
extras/iris_pipeline_project/hyperparams.json Logistic regression hyperparameters
extras/iris_pipeline_project/data_preprocess.py Data preprocessing script for Iris dataset
extras/iris_pipeline_project/data_folder/iris_train.csv Training dataset
extras/iris_pipeline_project/data_folder/iris_test.csv Test dataset
extras/iris_pipeline_project/client/client_train.py Client script for training models
extras/iris_pipeline_project/client/client_inference.py Client script for inference
extras/iris_pipeline_project/README.md Project documentation
extras/iris_pipeline_project/Dockerfile Docker configuration with Intel optimizations
extras/iris_pipeline_project/.gitignore Git ignore file

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

import sys

if len(sys.argv) < 2:
print("Usage: python datapreprocess.py <output_directory>")
Copy link
Preview

Copilot AI Sep 1, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The filename in the usage message should match the actual filename: 'data_preprocess.py' instead of 'datapreprocess.py'.

Suggested change
print("Usage: python datapreprocess.py <output_directory>")
print("Usage: python data_preprocess.py <output_directory>")

Copilot uses AI. Check for mistakes.

@darksapien23151 darksapien23151 changed the title Logistic Regression-patch2 Optimized serving endpoint for classical ML models Sep 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
GSoC Contributions that are part of Google Summer of Code projects
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants