Credit Risk Scorecard

Credit risk scoring model using XGBoost on the German Credit Dataset, with SHAP explanations and fairness checks.

What this does

Predicts loan default probability and converts it to a traditional credit score (300-850 range). The model also generates adverse action reasons — basically telling the applicant why they were rejected, which banks are legally required to do (ECOA, GDPR Article 22).

Dataset

German Credit Dataset from UCI — 1000 loan applications, 20 features, binary outcome. Not huge, but it's a standard benchmark and the categorical encoding is a pain to deal with (everything is coded as A11, A12, etc).

Results

Metric	Score
AUC-ROC	0.78
Gini	0.56
KS	0.46

Not amazing, but reasonable for 1000 samples with no heavy tuning.

Setup

pip install -r requirements.txt
python main.py

Or run the dashboard:

streamlit run streamlit_app/app.py

Project structure

src/data/ — loading + preprocessing the German Credit data
src/features/ — feature engineering (debt ratios, stability scores, etc)
src/models/ — XGBoost trainer + scorecard conversion
src/explainability/ — SHAP explanations + Fairlearn fairness audit
streamlit_app/ — interactive dashboard
tests/ — unit tests

Explainability

Each prediction comes with SHAP values showing which features pushed the score up or down. There's also an adverse action module that maps SHAP contributions to human-readable denial reasons (e.g. "Insufficient checking account history").

Fairness

The model gets audited across age groups and gender using Fairlearn. Checks the four-fifths rule (80% rule) and demographic parity. On this dataset the model passes, but barely — the age group disparity is close to the threshold.

What I'd do differently

The German Credit dataset is from 1994 and uses Deutsche Marks. Would be better with more recent data
I should have tried WoE (Weight of Evidence) binning — that's what actual banks use for scorecards
The feature engineering is manual. Could try automated feature selection with Boruta or similar
Hyperparameter tuning is basically default XGBoost params with minor tweaks

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github		.github
models		models
notebooks		notebooks
src		src
streamlit_app		streamlit_app
tests		tests
.gitignore		.gitignore
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Credit Risk Scorecard

What this does

Dataset

Results

Setup

Project structure

Explainability

Fairness

What I'd do differently

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Credit Risk Scorecard

What this does

Dataset

Results

Setup

Project structure

Explainability

Fairness

What I'd do differently

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages