-
Notifications
You must be signed in to change notification settings - Fork 351
Added project on green software by Uni Manchester / SMARTHEP #1681
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 10 commits
Commits
Show all changes
12 commits
Select commit
Hold shift + click to select a range
e7e114d
Create proposal_SMARTHEP_GreenSoftware
caterina-doglioni 36b9674
Rename proposal_SMARTHEP_GreenSoftware to proposal_SMARTHEP_GreenSoft…
caterina-doglioni 974c812
Update mentors.md
caterina-doglioni 35e7927
Create umanchester.md
caterina-doglioni ea59ffa
Update mentors.md
caterina-doglioni a211000
Update proposal_SMARTHEP_GreenSoftware.md
caterina-doglioni cfa8d3e
Update umanchester.md
caterina-doglioni 6f2f32b
Update proposal_SMARTHEP_GreenSoftware.md
caterina-doglioni d153e31
Update umanchester.md
caterina-doglioni 91e69ed
Update umanchester.md
caterina-doglioni 46fc683
Update proposal_SMARTHEP_GreenSoftware.md
caterina-doglioni c5ac5a9
Update mentors.md
caterina-doglioni File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,11 @@ | ||
--- | ||
title: "University of Manchester" | ||
author: "Caterina Doglioni" | ||
layout: default | ||
organization: UManchester | ||
logo: UofM-logo.png | ||
description: | | ||
The [University of Manchester](<https://www.manchester.ac.uk>) is a leading UK research university. We have a large particle physics group with contributions to LHC experiments, dark matter, flavour, neutrino and muon experiments. We also carry out research into new detector technologies and new data acquisition strategies for future experiments. We are also involved in distributed computing for LHC experiments, hosting one of the largest and most successful Tier-2 distributed computing centres in the UK. | ||
--- | ||
|
||
{% include gsoc_proposal.ext %} |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
@@ -0,0 +1,67 @@ | ||||||
--- | ||||||
title: Estimating the energy cost of scientific software | ||||||
layout: gsoc_proposal | ||||||
project: SMARTHEP | ||||||
year: 2025 | ||||||
organization: | ||||||
- UManchester | ||||||
- CERN | ||||||
difficulty: medium | ||||||
duration: 350 | ||||||
mentor_avail: June-October (with 2-3 weeks mentor vacation where student will work independently with minimal guidance) | ||||||
--- | ||||||
# Description | ||||||
|
||||||
At a time where “energy crisis” is something that we hear daily, | ||||||
we can’t help but wonder whether our research software can be made more sustainable, | ||||||
and more efficient as a byproduct. | ||||||
In particular, this question arises for scientific software used in high-throughput scientific | ||||||
computing, where large datasets composed of many similar chunks are analysed with similar operations | ||||||
on each chunk of data. | ||||||
Moreover, CPU/GPU-efficient software algorithms are crucial for the real-time data selection (trigger) | ||||||
systems in LHC experiments, | ||||||
as the initial data analysis necessary to select interesting collision events | ||||||
is executed on a computing farm located at CERN that has finite CPU resources. | ||||||
|
||||||
The questions we want to start answering in this work are: | ||||||
* what is the trade off between performance of a ML algorithm and its energetic efficiency? | ||||||
* can small efficiency improvements in ML algorithms running on Large Hadron Collider data | ||||||
have a sizable energetic impact? | ||||||
* how do these energy efficiency improvements vary | ||||||
when using different computing architectures (1) and/or job submission systems (2)? | ||||||
|
||||||
## Task ideas | ||||||
|
||||||
The students in this project will use metrics from the [Green Software Foundation](<https://greensoftware.foundation>) | ||||||
and from other selected resources to estimate the energy efficiency of machine learning software from LHC experiments | ||||||
(namely, top tagging using ATLAS Open data) and from machine learning algorithms for data compression | ||||||
(there is another GSoC project developing this code, called Baler). | ||||||
This work will build on previous GSoC / Master's thesis work, and will expand these results for GPU architectures. | ||||||
If time allows, the student will then have the chance to make small changes to the code | ||||||
to make it more efficient, and evaluate possible savings. | ||||||
|
||||||
## Expected results and milestones | ||||||
|
||||||
* Understand and summarise the metrics for software energy consumption, focusing on computing resources at CERN; | ||||||
* Become familiar with running and debugging the selected software frameworks and algorithms; | ||||||
* Set up tests and visualisation for applying metrics to the selected software | ||||||
* Run tests and visualise results (preferably using a Jupyter notebook) | ||||||
* Vary platforms and job submission systems | ||||||
* Identify possible improvements, apply them, and run tests again | ||||||
|
||||||
## Requirements | ||||||
|
||||||
* Python | ||||||
* git | ||||||
* Jupyter notebooks | ||||||
* PyTorch or equivalent ML toolkit | ||||||
* Desirable: code profiling experience | ||||||
|
||||||
## Mentors | ||||||
|
||||||
* **[Caterina Doglioni](mailto:[email protected])** | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
If possible add a second mentor, even just as backup There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Thanks, done |
||||||
|
||||||
## Links | ||||||
|
||||||
* (1) [Green Software Foundation course](<https://learn.greensoftware.foundation/>) | ||||||
* (2) [Code by the previous GSoC student](<https://summerofcode.withgoogle.com/archive/2023/projects/Nks9akq7>) |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,6 +11,7 @@ layout: plain | |
* Lukas Breitwieser [[email protected]](mailto:[email protected]) CERN | ||
* Andy Buckley [[email protected]](mailto:[email protected]) UofGlasgow | ||
* Vipul Cariappa [[email protected]](mailto:[email protected]) CompRes | ||
* Caterina Doglioni [[email protected]](mailto:[email protected]) UManchester | ||
* Mateusz Fila [[email protected]](mailto:[email protected]) CERN | ||
* Chris Gutschow [[email protected]](mailto:[email protected]) UCLondon | ||
* Aaron Jomy [[email protected]](mailto:[email protected]) CERN/CompRes | ||
|
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To differentiate the title from last year