Skip to content

MohdMudaseer/ml_project

Repository files navigation

🎯 Student Math Score Prediction - Machine Learning Project

This project aims to predict the math score of students based on various academic and demographic features. It showcases a full machine learning workflow, from data preprocessing and EDA to feature engineering and model evaluation.

📂 Dataset Description

Column Description
genderStudent's gender (male / female)
race_ethnicityStudent group based on race/ethnicity
parental_level_of_educationParent's highest education level
lunchType of lunch (standard / free/reduced)
test_preparation_courseCompletion of test preparation course
reading_scoreReading test score
writing_scoreWriting test score
math_scoreTarget variable - score in mathematics

📝 Sample Dataset

genderrace_ethnicityparental_level_of_educationlunch test_preparation_coursereading_scorewriting_scoremath_score
femalegroup Bbachelor's degreestandardnone727472
femalegroup Csome collegestandardcompleted908869
femalegroup Bmaster's degreestandardnone959390
malegroup Aassociate's degreefree/reducednone574447
malegroup Csome collegestandardnone787576

🎯 Project Objectives

  • 📊 Conduct Exploratory Data Analysis (EDA) to understand distributions and correlations.
  • 🧹 Preprocess data with encoding, scaling, and transformation techniques.
  • 🧠 Train and evaluate multiple regression models to predict math_score.
  • 📈 Evaluate models using R², MAE, and RMSE metrics.

🧪 Machine Learning Techniques Applied

  • Categorical Encoding: Label & One-Hot Encoding
  • Feature Engineering: Derived features & normalization
  • Multiple Regression Models: Tried various ML regressors
  • Evaluation Metrics: R² Score, MAE, RMSE

📈 Key Insights

  • 📚 Students who completed the test preparation course generally scored higher.
  • ✍️ Reading and writing scores are strongly correlated with math scores.
  • 🎓 Parental education and lunch type influence student performance.

💻 Technologies Used

  • Python (Jupyter Notebook)
  • Pandas, NumPy – Data manipulation
  • Matplotlib, Seaborn – Visualizations
  • Scikit-learn – Machine learning
  • Flask – Web app deployment (optional)

▶️ How to Run

  1. Clone the repository or download the notebook file.
  2. Install dependencies:
    pip install -r requirements.txt
  3. Run the notebook step-by-step in Jupyter.
  4. Optional: Start Flask app using:
    python app.py

📬 Feedback & Contributions

Feel free to fork the repo, contribute, or share your suggestions!

📜 License

This project is licensed under the MIT License.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages