Skip to content

Package description

Gian Michele Innocenti edited this page Apr 10, 2019 · 15 revisions

Analysis package description:

This package is meant to run fast parallel analysis and machine learning optimization using modern servers with Python and Pandas. In order to start your analysis you need a list of unmerged flat ROOT TTrees for data and MC. For full compatibility, it is recommended to produce your TTrees using the same format presented in https://github.com/ginnocen/ALICETreeCreator.

  • Tree to Pandas DataFrame conversion
  • Skimming : dataframes are selected according to good run list, event selection, minimum pT
  • Optimization:
    • a subset of the MC and data are merged and used to optimise the selection strategy. For the ML optimization, the signal is taken from MC and the background from data side-bands.
    • Trained models are saved and made available for analysis
  • Model testing on data/MC:
    • The unmerged dataframes are processed. Candidates are selected according to standard analysis cuts or according to a loose cut on ML probability
    • Merged dataframes are created with candidates selected by the standard analysis or the ML probability
  • Invariant mass and efficiency building:
    • On the small skimmed dataset, invariant mass spectra and efficiency plots are created and stored in the same ROOT format as the regular task output (AnalysisResults.root)

Clone this wiki locally