Predicting-product-tier-Classification-

This example demonstrates the prediction of the product tier of cars sold on a website from the information contained in the columns of the data 'Items_Cars_Data.csv'. The file 'Data_description.csv' describes the columns. The entire chain of model development: data loading, derivation of new features, exploratory data analysis, preparation of data for training, model building, cross-validation, hyperparameters tuning, learning curves analysis, evaluation on the hold-out set (test data set), and feature importance analysis was covered. The algorithms include Logistic regression and Random forest. The prediction is a case of imbalanced classes. The imbalance was addressed by balancing the classes. This resulted in a significant improvement in learning and generalization, leading to more balanced accuracy and a higher F1 macro score, as well as a higher number of true positives for minority classes, as shown by the confusion matrix.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Boxplot_features.py		Boxplot_features.py
Data_Description.csv		Data_Description.csv
Encding_categorical_features.py		Encding_categorical_features.py
Items_Cars_Data.csv		Items_Cars_Data.csv
Learning_curves.py		Learning_curves.py
Permutation_feature_importance_plot.py		Permutation_feature_importance_plot.py
Plot_data_distribution.py		Plot_data_distribution.py
Predicting_product_tier_.ipynb		Predicting_product_tier_.ipynb
README.md		README.md
predicting_product_tier.py		predicting_product_tier.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting-product-tier-Classification-

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Predicting-product-tier-Classification-

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages