Welcome to the Loan Approval Predictions project! Below you'll find essential information on how to navigate through the project, understand the data, and utilize the predictive model for determining loan eligibility.
The dataset utilized for this project was obtained from Kaggle. It comprises 614 entries with 13 columns, encompassing various attributes related to loan applicants. Here's a breakdown of the columns:
- Loan_ID: Unique identifier for each loan application
- Gender: Gender of the applicant
- Married: Marital status of the applicant
- Dependents: Number of dependents the applicant has
- Education: Education level of the applicant
- Self_Employed: Whether the applicant is self-employed or not
- ApplicantIncome: Income of the applicant
- CoapplicantIncome: Income of the co-applicant
- LoanAmount: Amount of loan applied for
- Loan_Amount_Term: Term of the loan in Days
- Credit_History: Credit history of the applicant
- Property_Area: Type of property area (Rural,Urban & SubUrban)
- Loan_Status: Status of the loan (target variable)
To prepare the data for modeling, the following steps were undertaken:
- Data splitting: The dataset was divided into training and testing sets.
- Encoding and Scaling: Categorical data were encoded, and numerical features were scaled using standard scaler.
- Model Selection: Gradient Boosting was chosen as the base model for its predictive performance.
- Cross-validation and Hyperparameter Tuning: Grid search was performed to optimize model hyperparameters for better performance.
The final model was saved using a Pipeline, encompassing data preprocessing steps and the trained classifier. Additionally, ShapAnalysis was conducted to provide insights into feature importance for model explainability.
The model was deployed using Gradio, enabling users to interactively input applicant details and receive predictions regarding loan eligibility. However, currently, the deployment requires cloning the repository to your local system. Click here to access the deployed model.
Below is a screenshot of the web interface:
The project remains open for further development, particularly in the following areas:
- Data Enrichment: Sourcing real-world data that closely mirrors loan application scenarios to enhance model robustness.
- Deployment Enhancement: Implementing deployment methods that eliminate the need for local repository cloning, making the model accessible via a web interface without additional steps.
Feel free to contribute or explore these avenues for further improving the project's utility and effectiveness.
Thank you for your interest in Loan Approval Predictions!
