Industrial Image Anomaly Detection

A unified comparison framework for zero and few-shot industrial image anomaly detection, enabling systematic evaluation of state-of-the-art models across multiple industrial datasets. This project implements and compares two leading approaches: AnomalyDINO (few-shot anomaly detection via large-scale foundation models) and MuSc (Multi-Scale Contrastive Learning), providing researchers and practitioners with a standardized benchmark for assessing performance in resource-constrained industrial scenarios where labeled anomaly data is scarce or unavailable.

🚀 Features

Multiple Model Support: Implementations of AnomalyDINO and MuSc anomaly detection models
Multiple Dataset Support: MVTec AD, MVTec LOCO AD, BTAD, and ViSA datasets
Flexible Configuration: Hydra-based configuration system for easy experimentation
MLflow Integration: Comprehensive experiment tracking and model management
Various Backbones: Support for DINOv2, CLIP, and other vision transformer backbones
Few-Shot Learning: Configurable few-shot learning scenarios (0, 1, 2, 4, 8, 16, full shots)
Comprehensive Metrics: Detailed evaluation metrics for different datasets
Visualization Tools: Built-in visualization utilities for results analysis

📋 Requirements

Python 3.10+
PyTorch with CUDA support
FAISS (for efficient similarity search)
MLflow (for experiment tracking)
Hydra (for configuration management)

🔧 Installation

1. Clone the Repository

git clone https://github.com/your-username/industrial-image-anomaly-detection.git
cd industrial-image-anomaly-detection

2. Setup Conda Environment

conda env update --prefix ./.conda --file environment.yaml --prune
conda activate ./.conda

3. Dataset Setup

Download the required datasets:

MVTec AD: Download from MVTec AD Website
MVTec LOCO AD: Download from MVTec LOCO AD Website
BTAD: Download from BTAD Repository
ViSA: Download from ViSA Website

Update the dataset paths in the configuration files under conf/dataset/.

🎯 Usage

Basic Training and Evaluation

Run the main script with default configuration:

python main.py

Custom Configuration

You can override any configuration parameter:

# Change model and dataset
python main.py model=musc dataset=mvtec_ad

# Modify few-shot settings
python main.py shots=4 seed=42

# Enable/disable MLflow tracking
python main.py mlflow_enable=false

Configuration Options

Models

anomalydino: AnomalyDINO model with DINOv2 backbone
musc: MuSc model with CLIP backbone

Datasets

mvtec_ad: MVTec Anomaly Detection dataset
mvtec_loco_ad: MVTec LOCO AD dataset (logical and structural)
btad: BTAD dataset
visa: ViSA dataset

Few-Shot Learning

shots: Number of reference images (0, 1, 2, 4, 8, 16, or "full")
seed: Random seed for reproducibility
sampler_type: Sampling strategy ("musc" for random, "anomalydino" for sequence)

📊 Supported Datasets

Dataset	Categories	Image Types	Anomaly Types
MVTec AD	15 categories	Industrial objects/textures	Defects, damages
MVTec LOCO AD	5 categories	Industrial objects	Logical/structural anomalies
BTAD	3 categories	Industrial products	Surface defects
ViSA	12 categories	Industrial objects	Various anomalies

🏗️ Architecture

Models

AnomalyDINO

Backbone: DINOv2 Vision Transformer
Method: Feature extraction + k-NN similarity search

MuSc (Multi-Scale Contrastive Learning)

Backbone: CLIP Vision Transformer
Method: Multi-scale feature extraction with contrastive learning
Components: LNAMD, MSM, RsCIN, MSM+

Project Structure

├── conf/                   # Hydra configuration files
│   ├── config.yaml         # Main configuration
│   ├── dataset/            # Dataset configurations
│   └── model/              # Model configurations
├── datasets/               # Dataset implementations
├── metrics/                # Metrics implementations for each dataset
├── models/                 # Model implementations
│   ├── anomalydino/        # AnomalyDINO implementation
│   ├── musc/               # MuSc implementation
│   └── backbone/           # Backbone implementations
|── notebooks/              # Jupyter notebooks for exploration and image creation
├── utils/                  # Utility functions
└── main.py                 # Main training/evaluation script

📈 Experiment Tracking

The project integrates with MLflow for comprehensive experiment tracking:

Start MLflow server:

mlflow server --host 0.0.0.0 --port 5000

Access MLflow UI: Open http://localhost:5000 in your browser
Configuration: Enable/disable MLflow tracking in conf/config.yaml:

mlflow_enable: true
mlflow_run_name: "experiment_name"

🎨 Visualization

The project includes visualization tools for:

Sample images with anomaly masks
Model predictions vs ground truth
Feature maps and attention visualizations
Quantitative results plots

Enable visualization in the configuration:

visualize: true
num_samples: 5

📊 Metrics

The project includes multiple metrics implementations for each supported dataset or model in the metrics/ directory.

Each metrics file implements a compute_metrics() function that takes ground truth and prediction arrays and returns comprehensive evaluation metrics for both image-level and pixel-level anomaly detection performance.

🔧 Development

Adding New Datasets

Create a new dataset class in datasets/
Implement the required dataset interface
Add configuration file in conf/dataset/
Create corresponding metrics implementation in metrics/

Adding New Metrics

Create a metrics file in metrics/ following the pattern metrics_<dataset_name>.py
Implement the compute_metrics(gt_sp, pr_sp, gt_px, pr_px) function
Include metrics like: AUROC, F1-Max, AP (image-level) and AUROC, F1-Max, AUPRO (pixel-level)
Reference existing implementations: metrics/anomalydino.py, metrics/musc.py, metrics/mvtec_ad.py

Adding New Models

Create a new model class in models/your_model/
Implement the required interface methods
Add configuration file in conf/model/
Update the main script imports

Adding New Backbones

Create a new backbone class in models/backbone/ inheriting from BaseBackbone
Implement the required abstract methods:
- load_pretrained_model(): Load the pretrained weights for the backbone
- extract_features(images): Extract features from input images
Consider creating model-specific variants (e.g., YourBackboneMuSc, YourBackboneAnomalyDINO)
Register the backbone in backbone_factory.py in the appropriate factory functions
Update model configurations to use the new backbone
Test compatibility with existing models (AnomalyDINO, MuSc)

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

This project incorporates code from several research works:

AnomalyDINO - Licensed under Apache 2.0
MuSc - Licensed under MIT
DINOv2 - Licensed under Apache 2.0
OpenCLIP - Various licenses

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
conf		conf
datasets		datasets
metrics		metrics
models		models
notebooks		notebooks
utils		utils
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
main.py		main.py

Folders and files

Latest commit

History

Repository files navigation

Industrial Image Anomaly Detection

🚀 Features

📋 Requirements

🔧 Installation

1. Clone the Repository

2. Setup Conda Environment

3. Dataset Setup

🎯 Usage

Basic Training and Evaluation

Custom Configuration

Configuration Options

Models

Datasets

Few-Shot Learning

📊 Supported Datasets

🏗️ Architecture

Models

AnomalyDINO

MuSc (Multi-Scale Contrastive Learning)

Project Structure

📈 Experiment Tracking

🎨 Visualization

📊 Metrics

🔧 Development

Adding New Datasets

Adding New Metrics

Adding New Models

Adding New Backbones

📄 License

🙏 Acknowledgments

📚 References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages