Anomaly Detection & Automated Risk Reporting System

A production-structured pipeline for detecting abnormal behavioral and risk patterns in historical data, with automated natural-language report generation via local LLMs (DeepSeek/Qwen via Ollama) and RAG-based contextual enrichment.

Qwen is recommended for poor memory, is small and reliable, but I'd use deepseek if memory would'nt be a problem.

Project Structure

anomaly-risk-system/
├── config.py                   # Central configuration (paths, model, thresholds)
│
├── detection/
│   ├── detector.py             # MultiClassGaussianAnomalyDetector
│   └── features.py             # Feature engineering utilities
│
├── evaluation/
│   └── metrics.py              # AUC, AP, FPR, cross-val, per-class breakdown
│
├── reporting/
│   ├── knowledge_base.py       # FAISS RAG knowledge base
│   ├── prompts.py              # LangChain prompt templates
│   └── generator.py            # AnomalyReportGenerator (LLM + RAG)
│
├── models/                     # Saved detector .joblib files
├── knowledge_base/             # Persisted FAISS index
├── reports/                    # Output JSON reports
│
└── notebook/
    └── main_pipeline.ipynb     # Orchestration notebook

Quickstart

1. Install dependencies

pip install -r requirements.txt

2. Start Ollama and pull DeepSeek or Qwen

ollama serve
ollama pull deepseek-r1:8b | ollama pull qwen2.5:1.5b

3. Run the notebook

Open notebook/main_pipeline.ipynb and run cells top to bottom. The notebook covers:

Feature engineering
Detector training + cross-validation
ROC-AUC / Precision-Recall / FPR evaluation
RAG knowledge base construction
Automated report generation

Key Design Decisions

Decision	Rationale
Diagonal Mahalanobis	Keeps per-feature Z-score explanations interpretable
Robust estimators	Reduces sensitivity to training-set noise
Empirical thresholds	Matches actual score distribution vs. theoretical chi-square
Per-class models	Captures category-specific normal behaviour
FAISS + Ollama embeddings	Fully local -- no external API calls
DeepSeek via Ollama	Local LLM, no data leaves the machine

Evaluation Results

The multiple Gaussian model detector is simple and interpretable for our LLM to reason, the results in the test set were:

Metric	Score
ROC-AUC	0.976
Average Precision	0.543

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anomaly Detection & Automated Risk Reporting System

Project Structure

Quickstart

1. Install dependencies

2. Start Ollama and pull DeepSeek or Qwen

3. Run the notebook

Key Design Decisions

Evaluation Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
detection		detection
evaluation		evaluation
notebook		notebook
reporting		reporting
reports		reports
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Anomaly Detection & Automated Risk Reporting System

Project Structure

Quickstart

1. Install dependencies

2. Start Ollama and pull DeepSeek or Qwen

3. Run the notebook

Key Design Decisions

Evaluation Results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages