Skip to content

ravicoder01/house-price-prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🏠 House Price Prediction using Linear Regression

This project predicts the sale price of houses based on features like living area, overall quality, garage details, etc., using the Kaggle House Prices Dataset.


πŸ“‚ Dataset Overview

  • train.csv: Training data (1460 rows, 81 columns)
  • test.csv: Test data without SalePrice
  • sample_submission.csv: Sample output format for submission
  • Features include:
    • GrLivArea: Above-ground living area (sq ft)
    • OverallQual: Overall material and finish quality
    • GarageCars: Number of cars in garage
    • GarageArea: Size of garage in sq ft
    • And many more...

πŸ“Š Techniques Used

  • βœ… Data cleaning (dropna())
  • βœ… Exploratory Data Analysis (EDA)
  • βœ… Outlier Detection & Removal using scatterplots
  • βœ… Feature Selection using correlation analysis
  • βœ… Scaling with StandardScaler
  • βœ… Linear Regression model building
  • βœ… Custom user input-based prediction using .predict()

πŸ” Model Evaluation

  • RMSE: 42682.00
  • RΒ² Score: 0.76

Initially, accuracy was ~60%, which improved to 76% after:

  • Better feature selection
  • Removing outliers
  • Applying scaling

πŸ”’ Input Prediction Feature

The model allows you to input:

  • Living Area
  • Overall Quality
  • Garage Info
    ➑️ and gives you a predicted house price!
🏠 Estimated Price: β‚Ή 247337.64
# house-price-prediction

πŸš€ Future Improvements
Try different regression models: Ridge, Lasso, Random Forest

Add cross-validation

Perform hyperparameter tuning

πŸ“¦ Requirements
Install using:

bash
Copy
Edit
pip install -r requirements.txt
🧠 Learning Outcome
This was my first end-to-end ML regression project where I applied:

EDA

Feature Engineering

Visualization

Modeling

Evaluation

It boosted my confidence to apply what I learned into something real.

πŸ“Œ Project Author
πŸ‘¨β€πŸ’» Ravi Roy
B.Tech CSE, IILM University
Core IIC Club Member, Ex-Intern @ Oasis Infobyte, YBI Foundation, CodeAlpha
Passionate about AI/ML & solving real-world problems 🌱

πŸ”— Kaggle Dataset
https://www.kaggle.com/competitions/house-prices-advanced-regression-techniques
🏑 House Prices - Advanced Regression Techniques

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published