Skip to content

Ranketh2004/FDM-Mini-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

FDM-Mini-Project

Fundamentals of Data Mining Project

  1. Developed an end-to-end machine learning solution for predicting automobile loan defaults.
  2. Performed comprehensive data preprocessing techniques, including null value handling, dimensionality reduction, outlier treatment, class unbalancing, filling missing data, removing duplicates, and data
    normalization. Conducted data visualization to uncover key insights and trends in the dataset, aiding in feature selection and understanding data distributions. 3)Trained six machine learning models: Random Forest, CatBoost, LightGBM, GaussianNB, Logistic Regression, and XGBoost.
  3. Selected Random Forest as the best-performing model after extensive hyperparameter tuning using randomised search.
  4. Achieved improved prediction accuracy and addressed key challenges related to imbalanced data.

** We applied 3 XAI techniques to Random Forest Classifier, our final model, to gain insight into how the model understands the data and how each feature influences the model's output. They are Feature importance, Permutation Feature importance and Partial Dependancy plots

Libraries -: scikit-learn, pandas, numpy, seaborn, matplotlib, streamlit

Deployment link -: https://automobile-loan-default-prediction-system-xujqfzrxxmbapkvhz3hu.streamlit.app/

About

Automobile Loan Default Prediction

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published