This project aims to predict housing prices in Edmonton, Alberta, using various features such as property age, location, and zoning. It supports homeowners and investors in making informed property decisions through data analysis and machine learning models.
This analysis uses historical property data to identify trends and predict property values, helping homeowners find high-value neighborhoods and assisting investors with pricing insights for property investments.
- For Homeowners: Identify high-value neighborhoods suitable for long-term investment.
- For Investors: Forecast property prices based on critical features like location, zoning, and age.
Key steps included:
- Filtering to recent years (2019-2021) and focusing on residential zones.
- Conducting EDA to analyze complete records, targeting assessed value, garages, and build year.
- Engineering βAge of Propertyβ and excluding entries with missing data for location and property age.
- Dashboard: A Tableau dashboard visualizes property values, garage availability, build year, and zoning. View Dashboard
- Feature Importance: Property value showed correlations with features like garage presence, newer build years, and specific zoning types.
We tested three models to identify the best approach for accurate price predictions:
- Ridge Regression: Initial trials had low precision (RΒ² = 0.21).
- Random Forest Regressor: Achieved best results with RΒ² = 0.86 and a reduced Mean Squared Error.
- Gradient Boosting Regressor: Delivered stable performance with RΒ² = 0.72.
- The Random Forest Regressor model, with RΒ² = 0.86, provided the most accurate predictions, effectively identifying high-value properties.
To further enhance accuracy and insights:
- Integrate neighborhood crime and census data.
- Address missing values, especially for location and build year.
- Refine zoning categories to better target specific property types.
