diff --git a/_gsocorgs/2025/anl.md b/_gsocorgs/2025/anl.md new file mode 100644 index 000000000..0c6df330f --- /dev/null +++ b/_gsocorgs/2025/anl.md @@ -0,0 +1,11 @@ +--- +title: "ANL" +author: "Maciej Szymanski" +layout: default +organization: ANL +logo: ANL-logo.png +description: | + Argonne National Lab (ANL) is a multidisciplinary science and engineering research center, where talented scientists and engineers work together to answer the biggest questions facing humanity, from how to obtain affordable clean energy to protecting ourselves and our environment. +--- + +{% include gsoc_proposal.ext %} diff --git a/_gsocprojects/2025/project_ATLAS.md b/_gsocprojects/2025/project_ATLAS.md new file mode 100644 index 000000000..90d3887c0 --- /dev/null +++ b/_gsocprojects/2025/project_ATLAS.md @@ -0,0 +1,9 @@ +--- +project: ATLAS +layout: default +logo: ATLAS-logo.png +description: | + [ATLAS](http://atlas.cern) is one of the four major experiments at the [Large Hadron Collider](http://home.web.cern.ch/topics/large-hadron-collider) (LHC) at [CERN](http://home.cern/). It is a general-purpose particle physics experiment run by an international collaboration and is designed to exploit the full discovery potential and the huge range of physics opportunities that the LHC provides. +--- + +{% include gsoc_project.ext %} diff --git a/_gsocproposals/2025/proposal_ATLAS_lossy_compression.md b/_gsocproposals/2025/proposal_ATLAS_lossy_compression.md new file mode 100644 index 000000000..690ccf173 --- /dev/null +++ b/_gsocproposals/2025/proposal_ATLAS_lossy_compression.md @@ -0,0 +1,42 @@ +--- +title: Precision Recovery in Lossy-Compressed Floating Point Data for High Energy Physics +layout: gsoc_proposal +project: ATLAS +year: 2025 +organization: + - ANL + - CERN +difficulty: medium +duration: 350 +mentor_avail: July-September +--- + +## Description + +[ATLAS](http://atlas.cern) is one of the particle physics experiments at the [Large Hadron Collider](http://home.web.cern.ch/topics/large-hadron-collider) (LHC) at [CERN](http://home.cern/). With the planned upgrade of the LHC (the so-called High Luminosity phase), allowing for even more detailed exploration of fundamental particles and forces of nature, it is expected that the recorded data rate will be up to ten times greater than today. One of the methods of addressing this storage challenge is data compression. The traditional approach involves lossless compression algorithms such as zstd and zlib. To further reduce storage footprint, methods involving lossy compression are being investigated. One of the solutions in High Energy Physics is the reduction of floating point precision, as stored precision may be higher than detector resolution. However, when reading data back, physicists may be interested in restoring the precision of the floating point numbers. This is obviously impossible in the strict sense, as the process of removing bits is irreversible. Nevertheless, given that the data volume is high, some variables are correlated, and follow specific distributions, one may consider a machine learning approach to recover the lossy-compressed floating-point data. + +## Task ideas + + * Perform lossy compression of data sample from the ATLAS experiment + * Investigate ML techniques for data recovery, prediction and upscaling + * Integrate the chosen technique into HEP workflow + +## Expected results + + * Implementation of ML-based procedure to restore precision of lossy-compressed floating-point numbers in ATLAS data + * Evaluation of the method's performance (decompression accuracy) and its applicability in HEP workflow + +## Requirements + + * C++, Python, Machine Learning + +## Mentors + + * **[Maciej Szymański](mailto:maciej.szymanski@cern.ch)** + * [Peter Van Gemmeren](mailto:peter.van.gemmeren@cern.ch) + +## Links + + * [IEEE_754](https://en.wikipedia.org/wiki/IEEE_754) + * [Implementation of FloatCompressor in Athena](https://gitlab.cern.ch/atlas/athena/-/blob/main/Control/CxxUtils/Root/FloatCompressor.cxx) + diff --git a/gsoc/2025/mentors.md b/gsoc/2025/mentors.md index a2ad09321..b4ad87139 100644 --- a/gsoc/2025/mentors.md +++ b/gsoc/2025/mentors.md @@ -27,6 +27,8 @@ layout: plain * Mayank Sharma [mayank.sharma@cern.ch](mailto:mayank.sharma@cern.ch) UMich * Simon Spannagel [simon.spannagel@desy.de](mailto:simon.spannagel@desy.de) DESY * Graeme Stewart [graeme.andrew.stewart@cern.ch](mailto:graeme.andrew.stewart@cern.ch) CERN +* Maciej Szymański [maciej.szymanski@cern.ch](mailto:maciej.szymanski@cern.ch) ANL +* Peter Van Gemmeren [peter.van.gemmeren@cern.ch](mailto:peter.van.gemmeren@cern.ch) ANL * Martin Vasilev [mvassilev@uni-plovdiv.bg](mailto:mvassilev@uni-plovdiv.bg) University of Plovdiv, BG * Vassil Vassilev [vvasilev@cern.ch](mailto:vvasilev@cern.ch) CompRes * Valentin Volkl [valentin.volkl@cern.ch](mailto:valentin.volkl@cern.ch) CERN