Project Aegis: Graph-SAGE Model for AML Detection

Team 76 | IMI Big Data & AI Competition 2026

Project Aegis is an end-to-end Anti-Money Laundering (AML) solution. While traditional systems look at individual transactions in isolation, Aegis uses Graph Neural Networks to analyze the social and geographical "pockets" where financial crime thrives.

Link to project

How It's Made

Tech used: Python, PyTorch Geometric, Streamlit, Llama 3.2 (via Ollama), Scikit-Learn.

Model: Inductive GraphSAGE

We implemented a GraphSAGE (SAmple and aggreGatE) architecture. Unlike standard Graph Attention Networks (GATs) that are transductive (can only score nodes they've seen before), GraphSAGE learns an aggregation function.

This allows our model to score "unseen" customers or new merchants in real-time without retraining the entire graph.

We represented Customers, Locations, and Merchant Categories as nodes, with the over 6 million transactions connecting them as edges.

With labeled fraud at only ~1% in the original dataset, we utilized two advanced techniques:

Integrated the BankSim agent-based simulation dataset to boost our initial fraud signal to ~6%.

Iterative Pseudo-Labeling: The model was trained on high-confidence "initially labeled" cases, then allowed to label the rest of the 6-million-transaction dataset as it gained confidence, effectively turning our unlabeled data into a massive training resource.

Explainability: The "Shadow" Random Forest

To solve the "Black Box" problem of Graph Networks, we built a dual-layer explanation system:

Proxy Modeling: A Random Forest model was trained to mimic the GraphSAGE's decision-making process using processed customer features.

SHAP Analysis: We used Shapley values to extract exactly which features (e.g., "Transactions in last 24h" or "Cash %") triggered the flag.

LLM Narratives: These SHAP values were fed into Llama 3.2:3b to generate human-readable SAR (Suspicious Activity Report) narratives and connected to possible organization types identified in our AML Knowledge Library.

Example: " Customer exhibits behavior consistent with Project Guardian (Synthetic Opioids) or Project Protect (Human Trafficking). Further investigation and analysis are required to confirm these findings and determine the specific risk profile."

How to run

Ensure you have Python 3.10 - 3.12. Run the setup script at the top of train.ipynb and confirm that all packages are installed correctly. If not, you may need to clear the existing versions to make torch, numpy, torch-scatter, etc. compatible with each other by running the following commands:

pip uninstall torch torch-scatter torch-sparse numpy scipy -y pip cache purge Then run the installation cell again. 2. Simply run the rest of the cells! A fresh model will be trained and the Streamlit web app will open at localhost:8502(For some reason it says localhost:8501.) You can also run streamlit run Home.py from the root directory of the project.

        If you run the Streamlit app before train.ipynb, the model and necessary assets will be downloaded automatically. But that's not nearly as cool.

Within the web app, navigate using the side panel on the left.

The "Knowledge Library" tab is an interactive database of AML/TF research we have conducted with sources.
The "Run Model" tab allows you to add transactions to a customer and see what risk score the model would output, with explanations in real time.
The "Model Output" tab shows the top K highest risk customers from the dataset with full graph visualizations, explanations, and options to view past transactions for human analysts to review. Decisions can be exported to csv.

Performance & Results

We evaluated the model using two primary metrics to account for the heavy class imbalance: ROC-AUC - 0.996 Measures the trade-off between True Positives and False Positives. PR-AUC - 0.986 Specifically measures success in catching criminals (Recall) vs. accuracy of flags (Precision).

Optimizations

Memory Efficiency: The models were trained on lab machines that have a 9.5GB disk quota. To address this, we implemented Gzip compression for all CSV assets and a custom file-flattening script to manage the 213MB model artifacts efficiently.

Inductive Inference: The Streamlit app runs inference on the CPU using pre-computed embeddings, making it fast even on low-spec hardware (i5/8GB RAM).

Lessons Learned

In AML, individual behaviour is easy to hide, while relationships are not. Creating a robust startup script is really hard, getting things to work between different machines and versions of Python took a lot of time and effort, especially with the torch-sparse library. Research is incredibly important, 90% of the total competition time we had was spent looking at different approaches to the problem, watching YouTube videos, and gradually piecing an idea together.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github/prompts		.github/prompts
lib		lib
pages		pages
.gitignore		.gitignore
Home.py		Home.py
README.md		README.md
csv_to_gzip.py		csv_to_gzip.py
display.py		display.py
generate_explanations.py		generate_explanations.py
model_evaluation_sage.png		model_evaluation_sage.png
requirements.txt		requirements.txt
showcase.ipynb		showcase.ipynb
test_explanations.py		test_explanations.py
training.ipynb		training.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Aegis: Graph-SAGE Model for AML Detection

How It's Made

How to run

Performance & Results

Optimizations

Lessons Learned

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

freq1062/imi-bigdata-2026

Folders and files

Latest commit

History

Repository files navigation

Project Aegis: Graph-SAGE Model for AML Detection

How It's Made

How to run

Performance & Results

Optimizations

Lessons Learned

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages