omnifoldTop

Omnifold for ttbar l+jets final states.

See https://github.com/ericmetodiev/OmniFold.git for the original repo from the OmniFold authors.

Install

git clone https://github.com/zctao/omnifoldTop.git

To run in the tensorflow container:

singularity pull docker://tensorflow/tensorflow:2.7.0-gpu
singularity run --nv [--bind DATADIR] tensorflow_2.7.0-gpu.sif

To set up virtual environment:

source scripts/setup_venv.sh venv requirements.txt

Dependencies

See

requirements.txt

Run

Set up environment and Python path:

source setup.sh

The main application to run is

python scripts/unfoldv2.py -d DATA_FILES -s SIMULATION_FILES \
                  [-o OUTPUT_DIRECTORY] \
                  [-i NUMBER_OF_ITERATIONS] \
                  [--observables LIST_OF_VARRIABLES_TO_UNFOLD]

To see all argument options:

python scripts/unfoldv2.py -h

Input files are expected to be ROOT files produced by ntuplerTT. In case the DATA_FILES are actually pseudo data from MC simulations, a flag -t can be added to indicate the "MC truth" is known and can be used to evaluate the performance.

In case one wishes to reuse the previously trained results, an option --unfolded-weights PATH_TO_UNFOLDED_WEIGHTS_FILES can be used to read the event weights from the specified files and apply them directly to other variables. Or --load-models MODELS_DIR can be used to load the trained models for reweighting directly in iterations.

[NEW] Unfolding can also be run from

./run_unfold.py RUN_CONFIG

RUN_CONFIG is a JSON config file that are used to set the arguments for scripts/unfoldv2.py. If the value of an argument is a dictionary, the keys of the dictionary are used as labels to create sub-directory in OUTPUT_DIRECTORY, and unfoldings are run with all combinations of the arguments. See one example run config file in configs/run/example.json.

Other useful scripts

evaluate_systematics.py [TODO]

[DEPRECATED]

evaluateModels.py: if input data files are pseudo data, i.e. MC truth is available, evaluateModels.py can be used to reweight simulation truth directly to the pseudo data truth and compare the reweighted distribution with the actual MC truth in the pseudo data. The goal is to evaluate the performance of the classifier model without running OmniFold iterations.
```
  python3 evaluateModels.py -d DATA_FILES -s SIMULATION_FILES \
                            [-o OUTPUT_DIRECTORY] \
                            [--observables LIST_OF_VARIABLES_TO_UNFOLD]
```
To see all argument options:
```
  python3 evaluateModels.py -h
```

scripts/rootReader.py: convert root files into npz files containing numpy structure arrays.

  python3 scripts/rootReader.py ROOT_FILES \
                                --truth-level parton|particle \
                                [-o OUTPUT_NAME]

scripts/makeRunScript.py: generate a bash script to run multiple unfolding iterations with different parameters given a run configuration file.
```
  python3 scripts/makeRunScript.py RUNCONFIG [-o OUTPUT_NAME]
```
An example of the run config file is provided in configs/run/basic_tests.json.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

omnifoldTop

Install

Dependencies

Run

Other useful scripts

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 553 Commits
configs		configs
python		python
scripts		scripts
test		test
.gitignore		.gitignore
README.md		README.md
evaluateModels.py		evaluateModels.py
replot_unfold.py		replot_unfold.py
requirements.txt		requirements.txt
requirements_cedar.txt		requirements_cedar.txt
run_unfold.py		run_unfold.py
setup.sh		setup.sh
setup_cedar.sh		setup_cedar.sh

Folders and files

Latest commit

History

Repository files navigation

omnifoldTop

Install

Dependencies

Run

Other useful scripts

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages