Skip to content

Commit 10b8469

Browse files
authored
Merge branch 'HSF:main' into new-gsoc-project
2 parents da897ba + 955278a commit 10b8469

File tree

6 files changed

+164
-4
lines changed

6 files changed

+164
-4
lines changed

_activities/gsoc.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -15,10 +15,10 @@ and analyze petabytes of data from high-energy physics experiments, such as thos
1515
hosted at the CERN laboratory in Geneva, Switzerland.
1616
Some of the questions that we collectively ask are:
1717

18-
- what are the fundamental blocks that make up our Universe?
19-
- what is the nature of dark matter and dark energy?
20-
- what is the nature of the asymmetry between matter and antimatter?
21-
- what was early Universe like?
18+
- What are the fundamental blocks that make up our Universe?
19+
- What is the nature of dark matter and dark energy?
20+
- What is the nature of the asymmetry between matter and antimatter?
21+
- What was early Universe like?
2222

2323
To answer these questions, particle physicists build software to simulate and analyze what happens in particle physics detectors.
2424

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
---
2+
project: JuliaHEP
3+
title: JuliaHEP
4+
layout: default
5+
logo: juliahep/juliaheplogo.png
6+
description: |
7+
The [JuliaHEP](https://hepsoftwarefoundation.org/activities/juliahep.html) working group brings together a community of developers and users of Julia in Particle Physics, with the aim of improving the sharing of knowledge and expertise, as well as unify effort in developing Julia packages useful for the community.
8+
---
9+
10+
{% include gsoc_project.ext %}
Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,43 @@
1+
---
2+
title: Implement and improve an efficient, layered tape with prefetching capabilities
3+
layout: gsoc_proposal
4+
project: Clad
5+
year: 2025
6+
difficulty: medium
7+
duration: 350
8+
mentor_avail: June-October
9+
organization:
10+
- CompRes
11+
---
12+
13+
## Description
14+
15+
In mathematics and computer algebra, automatic differentiation (AD) is a set of techniques to numerically evaluate the derivative of a function specified by a computer program. Automatic differentiation is an alternative technique to Symbolic differentiation and Numerical differentiation (the method of finite differences). Clad is based on Clang which provides the necessary facilities for code transformation. The AD library can differentiate non-trivial functions, to find a partial derivative for trivial cases and has good unit test coverage.
16+
17+
The most heavily used entity in AD is a stack-like data structure called a tape. For example, the first-in last-out access pattern, which naturally occurs in the storage of intermediate values for reverse mode AD, lends itself towards asynchronous storage. Asynchronous prefetching of values during the reverse pass allows checkpoints deeper in the stack to be stored furthest away in the memory hierarchy. Checkpointing provides a mechanism to parallelize segments of a function that can be executed on independent cores. Inserting checkpoints in these segments using separate tapes enables keeping the memory local and not sharing memory between cores. We will research techniques for local parallelization of the gradient reverse pass, and extend it to achieve better scalability and/or lower constant overheads on CPUs and potentially accelerators. We will evaluate techniques for efficient memory use, such as multi-level checkpointing support. Combining already developed techniques will allow executing gradient segments across different cores or in heterogeneous computing systems. These techniques must be robust and user-friendly, and minimize required application code and build system changes.
18+
19+
This project aims to improve the efficiency of the clad tape and generalize it into a tool-agnostic facility that could be used outside of clad as well.
20+
21+
## Expected Results
22+
23+
* Optimize the current tape by avoiding re-allocating on resize in favor of using connected slabs of array
24+
* Enhance existing benchmarks demonstrating the efficiency of the new tape
25+
* Add the tape thread safety
26+
* Implement multilayer tape being stored in memory and on disk
27+
* [Stretch goal] Support cpu-gpu transfer of the tape
28+
* [Stretch goal] Add infrastructure to enable checkpointing offload to the new tape
29+
* [Stretch goal] Performance benchmarks
30+
31+
32+
## Requirements
33+
34+
* Automatic differentiation
35+
* C++ programming
36+
* Clang frontend
37+
38+
## Mentors
39+
* **[Vassil Vassilev](mailto:[email protected])**
40+
* [David Lange](mailto:[email protected])
41+
42+
## Links
43+
* [Repo](https://github.com/vgvassilev/clad)
Lines changed: 40 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,40 @@
1+
---
2+
title: Implement CppInterOp API exposing memory, ownership and thread safety information
3+
layout: gsoc_proposal
4+
project: Cppyy
5+
year: 2025
6+
difficulty: medium
7+
duration: 350
8+
mentor_avail: June-October
9+
organization:
10+
- CompRes
11+
---
12+
13+
## Description
14+
15+
Incremental compilation pipelines process code chunk-by-chunk by building an ever-growing translation unit. Code is then lowered into the LLVM IR and subsequently run by the LLVM JIT. Such a pipeline allows creation of efficient interpreters. The interpreter enables interactive exploration and makes the C++ language more user friendly. The incremental compilation mode is used by the interactive C++ interpreter, Cling, initially developed to enable interactive high-energy physics analysis in a C++ environment.
16+
17+
Clang and LLVM provide access to C++ from other programming languages, but currently only exposes the declared public interfaces of such C++ code even when it has parsed implementation details directly. Both the high-level and the low-level program representation has enough information to capture and expose more of such details to improve language interoperability. Examples include details of memory management, ownership transfer, thread safety, externalized side-effects, etc. For example, if memory is allocated and returned, the caller needs to take ownership; if a function is pure, it can be elided; if a call provides access to a data member, it can be reduced to an address lookup. The goal of this project is to develop API for CppInterOp which are capable of extracting and exposing such information AST or from JIT-ed code and use it in cppyy (Python-C++ language bindings) as an exemplar. If time permits, extend the work to persistify this information across translation units and use it on code compiled with Clang.
18+
19+
## Project Milestones
20+
21+
* Collect and categorize possible exposed interop information kinds
22+
* Write one or more facilities to extract necessary implementation details
23+
* Design a language-independent interface to expose this information
24+
* Integrate the work in clang-repl and Cling
25+
* Implement and demonstrate its use in cppyy as an exemplar
26+
* Present the work at the relevant meetings and conferences.
27+
28+
## Requirements
29+
30+
* C++ programming
31+
* Python programming
32+
* Knowledge of Clang and LLVM
33+
34+
## Mentors
35+
* **[Vassil Vassilev](mailto:[email protected])**
36+
* [Aaron Jomy](mailto:[email protected])
37+
* [David Lange](mailto:[email protected])
38+
39+
## Links
40+
* [Repo](https://github.com/compiler-research/CppInterOp)
Lines changed: 64 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,64 @@
1+
---
2+
title: Julia Interfaces to HepMC3
3+
layout: gsoc_proposal
4+
project: JuliaHEP
5+
year: 2025
6+
organization:
7+
- CERN
8+
difficulty: medium
9+
duration: 175
10+
mentor_avail: June-July- August
11+
---
12+
13+
## Description
14+
15+
In high-energy physics experiments at [CERN](https://home.cern/) it is necessary to simulate physics events in order to compare predicted observations with those that the LHC experiments actually observe. A key piece of the software chain used to do that is the [HepMC3](https://arxiv.org/abs/1912.08005) event record library, which encodes the output from physics event generators in a standard way, so that they can be used by downstream detector simulation and analysis codes.
16+
17+
There is now [increasing interest](https://doi.org/10.1007/s41781-023-00104-x) in using [Julia](https://julialang.org/) as a language for HEP software, as it combines the ease of programming in interactive languages, e.g., Python, with the speed of compiled language, such as C++. As part of building up the ecosystem of supporting packages for Julia in high-energy physics, developing interfaces to read, manipulated and write HepMC3 event records in Julia is the aim of this project.
18+
19+
## Task ideas
20+
21+
This project would develop a wrapper library for [HepMC3](https://gitlab.cern.ch/hepmc/HepMC3) allowing the HepMC3 data objects and methods, in C++, to be called from Julia.
22+
23+
It would utilise the general underlying wrapper interfaces in [CxxWrap](https://github.com/JuliaInterop/CxxWrap.jl) and the automated wrapper code generator [WrapIt!](https://github.com/grasph/wrapit) to allow for as easy and maintainable an interface as possible.
24+
25+
A key outcome would be a set of unit tests and examples, based on the HepMC3 ones, demonstrating how to use the library and proving that the code is correct.
26+
27+
## Expected results and milestones
28+
29+
- Reading of HepMC3 event files
30+
- Particularly the ASCII format will be targeted first
31+
- Access to event data structures
32+
- Access to particle properties
33+
- Navigation of the event and the vertices between parent and child particles
34+
- Access to run information
35+
- Update of HepMC3 data structures
36+
- Creation of new HepMC3 events
37+
- Re-serialisation of these events to file
38+
- Initially ASCII
39+
- Documentation and examples on how to use the Julia interfaces
40+
- HepMC3.jl package registered in the Julia general registry
41+
- Extension of serialisation to ROOT format (stretch goal)
42+
43+
## Requirements
44+
45+
- Programming experience in C++
46+
- Prior experience in Julia (very advantageous)
47+
- A background understanding of high-energy physics (advantageous)
48+
49+
## Evaluation Exercise
50+
51+
TBD
52+
53+
## Mentors
54+
55+
- **[Graeme Stewart](mailto:[email protected])**
56+
- [Mateusz Fila](mailto:[email protected])
57+
58+
## Links
59+
60+
- [Julia Programming Language](https://julialang.org/)
61+
- [JuliaHEP HSF Group](https://hepsoftwarefoundation.org/workinggroups/juliahep.html)
62+
- [HepMC3 Repository](https://gitlab.cern.ch/hepmc/HepMC3)
63+
- [CxxWrap](https://github.com/JuliaInterop/CxxWrap.jl)
64+
- [WrapIt!](https://github.com/grasph/wrapit)

gsoc/2025/mentors.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,15 +9,18 @@ layout: plain
99
* Martin Barisits [[email protected]](mailto:[email protected]) CERN
1010
* Andy Buckley [[email protected]](mailto:[email protected]) UofGlasgow
1111
* Vipul Cariappa [[email protected]](mailto:[email protected]) CompRes
12+
* Mateusz Fila [[email protected]](mailto:[email protected]) CERN
1213
* Chris Gutschow [[email protected]](mailto:[email protected]) UCLondon
1314
* Aaron Jomy [[email protected]](mailto:[email protected]) CERN/CompRes
15+
* Christina Koutsou [[email protected]](mailto:@[email protected]) CompRes
1416
* Stephan Lachnit [[email protected]](mailto:[email protected]) DESY
1517
* David Lange [[email protected]](mailto:[email protected]) CompRes
1618
* Serguei Linev [[email protected]](mailto:[email protected]) GSI
1719
* Giacomo Parolini [[email protected]](mailto:[email protected]) CERN
1820
* Alexander Penev [[email protected]](mailto:[email protected]) CompRes/University of Plovdiv, BG
1921
* Mayank Sharma [[email protected]](mailto:[email protected]) UMich
2022
* Simon Spannagel [[email protected]](mailto:[email protected]) DESY
23+
* Graeme Stewart [[email protected]](mailto:[email protected]) CERN
2124
* Martin Vasilev [[email protected]](mailto:[email protected]) University of Plovdiv, BG
2225
* Vassil Vassilev [[email protected]](mailto:[email protected]) CompRes
2326
* Valentin Volkl [[email protected]](mailto:[email protected]) CERN

0 commit comments

Comments
 (0)