This project applies deep learning to distinguish between VBF and GGF Higgs production modes using the Classification Without Labels (CWoLa) framework. The approach is inspired by the paper Classification without labels: Learning from mixed samples in high energy physics, which introduces CWoLa as a viable strategy for learning directly from mixed real data samples.
-
Download miniconda through:
# Assuming Linux system wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh # Install thorugh sh. Type 'yes' when asking automatically initialization. sh Miniconda3-latest-Linux-x86_64.sh
-
Create a virtual environment:
# Initialization conda env create -f environment.yml # Update when the environment.yml changed conda env update -f environment.yml
-
Directly activate in Jupyter, or activate/exit with:
conda activate cwola conda deactivate
-
(Optional) Create a
.env
such that VSCode can fetch the packages.PYTHONPATH=~/miniconda3/envs/cwola/lib/python3.12/site-packages
-
CNN_Baseline
: A reference implementation based on VBF vs. GGF Higgs with Full-Event Deep Learning: Towards a Decay-Agnostic Tagger. This model utilizes event-level CNNs to perform classification. -
CNN_Light
: A simplified and more lightweight version ofCNN_Baseline
, designed to reduce model complexity and training time while retaining performance.
-
ParT_Baseline
: A transformer-based architecture based on Particle Transformer for Jet Tagging. This model captures particle-level features using attention mechanisms tailored for jet tagging tasks. -
ParT_*
: A family of lighter variants derived fromParT_Baseline
, offering faster training and inference with reduced computational cost.
The data preprocessings can be implemented by the following steps:
- Check the supported methods:
- data preprocessing: Check the methods provided in the class
src.data_preprocess.MCSimData
- data augmentation: Supported functions can be found in
src.data_augment
.
- data preprocessing: Check the methods provided in the class
- Give abbreviations for the preprocessing/augmentation methods in the class
LitDataModule
../notebooks/training.ipynb
→LitDataModule
→__init__
- Determine which preprocessings/augmentations to be used through YAML files in
./config
with file namedexp_*.yml
.