Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
11 changes: 11 additions & 0 deletions _gsocorgs/2025/anl.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
---
title: "ANL"
author: "Maciej Szymanski"
layout: default
organization: ANL
logo: ANL-logo.png
description: |
Argonne National Lab (ANL) is a multidisciplinary science and engineering research center, where talented scientists and engineers work together to answer the biggest questions facing humanity, from how to obtain affordable clean energy to protecting ourselves and our environment.
---

{% include gsoc_proposal.ext %}
9 changes: 9 additions & 0 deletions _gsocprojects/2025/project_ATLAS.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
---
project: ATLAS
layout: default
logo: ATLAS-logo.png
description: |
[ATLAS](http://atlas.cern) is one of the four major experiments at the [Large Hadron Collider](http://home.web.cern.ch/topics/large-hadron-collider) (LHC) at [CERN](http://home.cern/). It is a general-purpose particle physics experiment run by an international collaboration and is designed to exploit the full discovery potential and the huge range of physics opportunities that the LHC provides.
---

{% include gsoc_project.ext %}
42 changes: 42 additions & 0 deletions _gsocproposals/2025/proposal_ATLAS_lossy_compression.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,42 @@
---
title: Precision Recovery in Lossy-Compressed Floating Point Data for High Energy Physics
layout: gsoc_proposal
project: ATLAS
year: 2025
organization:
- ANL
- CERN
difficulty: medium
duration: 350
mentor_avail: July-September
---

## Description

[ATLAS](http://atlas.cern) is one of the particle physics experiments at the [Large Hadron Collider](http://home.web.cern.ch/topics/large-hadron-collider) (LHC) at [CERN](http://home.cern/). With the planned upgrade of the LHC (the so-called High Luminosity phase), allowing for even more detailed exploration of fundamental particles and forces of nature, it is expected that the recorded data rate will be up to ten times greater than today. One of the methods of addressing this storage challenge is data compression. The traditional approach involves lossless compression algorithms such as zstd and zlib. To further reduce storage footprint, methods involving lossy compression are being investigated. One of the solutions in High Energy Physics is the reduction of floating point precision, as stored precision may be higher than detector resolution. However, when reading data back, physicists may be interested in restoring the precision of the floating point numbers. This is obviously impossible in the strict sense, as the process of removing bits is irreversible. Nevertheless, given that the data volume is high, some variables are correlated, and follow specific distributions, one may consider a machine learning approach to recover the lossy-compressed floating-point data.

## Task ideas

* Perform lossy compression of data sample from the ATLAS experiment
* Investigate ML techniques for data recovery, prediction and upscaling
* Integrate the chosen technique into HEP workflow

## Expected results

* Implementation of ML-based procedure to restore precision of lossy-compressed floating-point numbers in ATLAS data
* Evaluation of the method's performance (decompression accuracy) and its applicability in HEP workflow

## Requirements

* C++, Python, Machine Learning

## Mentors

* **[Maciej Szymański](mailto:[email protected])**
* [Peter Van Gemmeren](mailto:[email protected])

## Links

* [IEEE_754](https://en.wikipedia.org/wiki/IEEE_754)
* [Implementation of FloatCompressor in Athena](https://gitlab.cern.ch/atlas/athena/-/blob/main/Control/CxxUtils/Root/FloatCompressor.cxx)

2 changes: 2 additions & 0 deletions gsoc/2025/mentors.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,8 @@ layout: plain
* Mayank Sharma [[email protected]](mailto:[email protected]) UMich
* Simon Spannagel [[email protected]](mailto:[email protected]) DESY
* Graeme Stewart [[email protected]](mailto:[email protected]) CERN
* Maciej Szymański [[email protected]](mailto:[email protected]) ANL
* Peter Van Gemmeren [[email protected]](mailto:[email protected]) ANL
* Martin Vasilev [[email protected]](mailto:[email protected]) University of Plovdiv, BG
* Vassil Vassilev [[email protected]](mailto:[email protected]) CompRes
* Valentin Volkl [[email protected]](mailto:[email protected]) CERN