Skip to content

Commit 9ee78a6

Browse files
authored
Add GSoC proposal for ATLAS lossy float compression study (#1685)
* Create proposal_ATLAS_lossy_compression.md * Create anl.md * Create project_ATLAS.md * Update mentors.md * Update mentors.md
1 parent b6fa38b commit 9ee78a6

File tree

4 files changed

+64
-0
lines changed

4 files changed

+64
-0
lines changed

_gsocorgs/2025/anl.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
---
2+
title: "ANL"
3+
author: "Maciej Szymanski"
4+
layout: default
5+
organization: ANL
6+
logo: ANL-logo.png
7+
description: |
8+
Argonne National Lab (ANL) is a multidisciplinary science and engineering research center, where talented scientists and engineers work together to answer the biggest questions facing humanity, from how to obtain affordable clean energy to protecting ourselves and our environment.
9+
---
10+
11+
{% include gsoc_proposal.ext %}
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
---
2+
project: ATLAS
3+
layout: default
4+
logo: ATLAS-logo.png
5+
description: |
6+
[ATLAS](http://atlas.cern) is one of the four major experiments at the [Large Hadron Collider](http://home.web.cern.ch/topics/large-hadron-collider) (LHC) at [CERN](http://home.cern/). It is a general-purpose particle physics experiment run by an international collaboration and is designed to exploit the full discovery potential and the huge range of physics opportunities that the LHC provides.
7+
---
8+
9+
{% include gsoc_project.ext %}
Lines changed: 42 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,42 @@
1+
---
2+
title: Precision Recovery in Lossy-Compressed Floating Point Data for High Energy Physics
3+
layout: gsoc_proposal
4+
project: ATLAS
5+
year: 2025
6+
organization:
7+
- ANL
8+
- CERN
9+
difficulty: medium
10+
duration: 350
11+
mentor_avail: July-September
12+
---
13+
14+
## Description
15+
16+
[ATLAS](http://atlas.cern) is one of the particle physics experiments at the [Large Hadron Collider](http://home.web.cern.ch/topics/large-hadron-collider) (LHC) at [CERN](http://home.cern/). With the planned upgrade of the LHC (the so-called High Luminosity phase), allowing for even more detailed exploration of fundamental particles and forces of nature, it is expected that the recorded data rate will be up to ten times greater than today. One of the methods of addressing this storage challenge is data compression. The traditional approach involves lossless compression algorithms such as zstd and zlib. To further reduce storage footprint, methods involving lossy compression are being investigated. One of the solutions in High Energy Physics is the reduction of floating point precision, as stored precision may be higher than detector resolution. However, when reading data back, physicists may be interested in restoring the precision of the floating point numbers. This is obviously impossible in the strict sense, as the process of removing bits is irreversible. Nevertheless, given that the data volume is high, some variables are correlated, and follow specific distributions, one may consider a machine learning approach to recover the lossy-compressed floating-point data.
17+
18+
## Task ideas
19+
20+
* Perform lossy compression of data sample from the ATLAS experiment
21+
* Investigate ML techniques for data recovery, prediction and upscaling
22+
* Integrate the chosen technique into HEP workflow
23+
24+
## Expected results
25+
26+
* Implementation of ML-based procedure to restore precision of lossy-compressed floating-point numbers in ATLAS data
27+
* Evaluation of the method's performance (decompression accuracy) and its applicability in HEP workflow
28+
29+
## Requirements
30+
31+
* C++, Python, Machine Learning
32+
33+
## Mentors
34+
35+
* **[Maciej Szymański](mailto:[email protected])**
36+
* [Peter Van Gemmeren](mailto:[email protected])
37+
38+
## Links
39+
40+
* [IEEE_754](https://en.wikipedia.org/wiki/IEEE_754)
41+
* [Implementation of FloatCompressor in Athena](https://gitlab.cern.ch/atlas/athena/-/blob/main/Control/CxxUtils/Root/FloatCompressor.cxx)
42+

gsoc/2025/mentors.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -27,6 +27,8 @@ layout: plain
2727
* Mayank Sharma [[email protected]](mailto:[email protected]) UMich
2828
* Simon Spannagel [[email protected]](mailto:[email protected]) DESY
2929
* Graeme Stewart [[email protected]](mailto:[email protected]) CERN
30+
* Maciej Szymański [[email protected]](mailto:[email protected]) ANL
31+
* Peter Van Gemmeren [[email protected]](mailto:[email protected]) ANL
3032
* Martin Vasilev [[email protected]](mailto:[email protected]) University of Plovdiv, BG
3133
* Vassil Vassilev [[email protected]](mailto:[email protected]) CompRes
3234
* Valentin Volkl [[email protected]](mailto:[email protected]) CERN

0 commit comments

Comments
 (0)