Skip to content

Commit 3f85ac7

Browse files
committed
Turn the Turing way into a reference, fix doc link
During review we moved our documentation onto readthedocs, so update that link. We added a reference to The Turing Way, but didn't include the proper citation. Both are fixed. We had two links to documentation. Remove one to avoid duplication.
1 parent 1d03faf commit 3f85ac7

File tree

2 files changed

+4
-3
lines changed

2 files changed

+4
-3
lines changed

docs/paper/paper.bib

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -98,4 +98,5 @@ @article{Jagannadharao2025
9898
year={2025}, title={Timeshifting strategies for carbon-efficient long-running large language model training},
9999
doi={10.1007/s11334-023-00546-x}, journal={Innovations in Systems and Software Engineering }, pages={517-531}, volume={21}}
100100
101-
101+
@book{TuringWay, author={{The Turing Way Community}}, title={{The Turing Way: A handbook for reproducible, ethical and collaborative research (1.2.3)}},
102+
year={2025}, doi={10.5281/zenodo.15213042}}

docs/paper/paper.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -73,11 +73,11 @@ The environmental impact of research computing is increasingly a topic of concer
7373

7474
The climate impact of research computing, computer science, and computational science is hard to understate. Computing impacts the environment in many ways, from water and abiotic resource usage to GHG emissions from energy consumption and electronic waste. For many researchers, the CO$_2$e (CO$_2$-equivalent, the usual metric of carbon footprint aggregating the main GHGs) cost associated with running their models is larger than that of any other aspects of their life [@PortegiesZwart_2020]. The global carbon footprint of data centres is estimated at 126 Mt CO$_2$e [@Malmodin2024], equivalent to the entire American commercial aviation sector, and individual computing projects can reach dozens, if not hundreds, of tonnes of CO$_2$e [@PortegiesZwart_2020; @GrealeyLannelongue2022; @Luccioni_Viguier_Ligozat_2022]. This is becoming clear to researchers and funders, who are exploring different approaches to reducing and accounting for the climate impact of the research they commission [@Weber_2024; @Juckes23; @Lannelongue_Fropier_Matencio_2025]. There is a clear need to provide tools for motivated researchers to minimise the detrimental contribution of their research on the climate emergency, and to educate researchers around this impact.
7575

76-
There are several approaches that have been proposed and/or implemented to reduce the carbon footprint of computing, which are summarised in The Turing Way (https://book.the-turing-way.org/ethical-research/activism/activism-env-impact) and in the CATS documentation (https://greenscheduler.github.io/cats/). One of these is time shifting the computation such that it runs when the power supplied by the local electricity grid is dominated by renewable generation such as on windy or sunny days rather than periods where legacy fossil fuel generation dominates. The potential impact of minimally-invasive time-shifting approaches has been shown to result in significantly reduced carbon footprints [27% in one AI benchmark @Dodge2022]. CATS helps researchers timeshift their own computation such that it is scheduled when the forecast carbon intensity of the power grid is minimised. This timeshifting approach has been applied in the cloud [@Wiesner2021], for large-language model training [@Jagannadharao2025], and to take account of carbon intensity when scaling Kubernetes workloads on Azure [@Norlander], but we are not aware of a similar user facing tool.
76+
There are several approaches that have been proposed and/or implemented to reduce the carbon footprint of computing, which are summarised in The Turing Way [@TuringWay] and in the CATS documentation. One of these is time shifting the computation such that it runs when the power supplied by the local electricity grid is dominated by renewable generation such as on windy or sunny days rather than periods where legacy fossil fuel generation dominates. The potential impact of minimally-invasive time-shifting approaches has been shown to result in significantly reduced carbon footprints [27% in one AI benchmark @Dodge2022]. CATS helps researchers timeshift their own computation such that it is scheduled when the forecast carbon intensity of the power grid is minimised. This timeshifting approach has been applied in the cloud [@Wiesner2021], for large-language model training [@Jagannadharao2025], and to take account of carbon intensity when scaling Kubernetes workloads on Azure [@Norlander], but we are not aware of a similar user facing tool.
7777

7878
# Approach and functionality
7979

80-
At its core CATS is an open-source (MIT licence) Python package (tested with Python 3.9-3.12) that combines data on the forecast carbon intensity of the local electricity supply with information about a proposed job's duration to assess the best start time of the computation within the validity interval of the forecast. Users typically interact with CATS via a command-line interface targeting the UNIX Shell (CATS is tested on Linux and MacOS) and the best start time can be provided in an informative format (that the user can then use with their infrastructure) or in a way that can be passed on to job schedulers to set the calculation start time. CATS is available via the Python Package Index (PyPI) and can be installed along with its handful of dependencies into a Python environment with pip. Development takes place on GitHub (https://github.com/GreenScheduler/cats) and documentation, including a fuller description of the approach, is available (https://greenscheduler.github.io/cats/).
80+
At its core CATS is an open-source (MIT licence) Python package (tested with Python 3.9-3.12) that combines data on the forecast carbon intensity of the local electricity supply with information about a proposed job's duration to assess the best start time of the computation within the validity interval of the forecast. Users typically interact with CATS via a command-line interface targeting the UNIX Shell (CATS is tested on Linux and MacOS) and the best start time can be provided in an informative format (that the user can then use with their infrastructure) or in a way that can be passed on to job schedulers to set the calculation start time. CATS is available via the Python Package Index (PyPI) and can be installed along with its handful of dependencies into a Python environment with pip. Development takes place on GitHub (https://cats.readthedocs.io/) and documentation, including a fuller description of the approach, is available (https://greenscheduler.github.io/cats/).
8181

8282
The user provides CATS with the duration of the proposed computation on the command line. They may also provide the location, the command to run and information about the scheduler. CATS then accesses a prediction of the carbon intensity of the relevant power distribution network and computes the start time that minimises the carbon intensity over the duration of the computation. An illustration of this calculation is provided in Figure 1. Once the carbon intensity minimisation has been completed, CATS can optionally submit the computation to a queueing system or make a more detailed report on the climate impact of the proposed computation.
8383

0 commit comments

Comments
 (0)