Skip to content

Commit ccea2c4

Browse files
committed
Merge branch 'main' into jupyter-update
2 parents ba052d4 + 85186ed commit ccea2c4

File tree

8 files changed

+12919
-19
lines changed

8 files changed

+12919
-19
lines changed

docs/access/jupyterlab.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,9 +3,9 @@
33

44
## Access and setup
55

6-
The JupyterHub service enables the interactive execution of JupyterLab on the compute nodes of [Daint][ref-cluster-daint], [Clariden][ref-cluster-clariden] and [Santis][ref-cluster-santis].
6+
The JupyterHub service enables the interactive execution of JupyterLab on the compute nodes of [Daint][ref-cluster-daint], [Clariden][ref-cluster-clariden], [Santis][ref-cluster-santis] and [Eiger][ref-cluster-eiger].
77

8-
The service is accessed at [jupyter-daint.cscs.ch](https://jupyter-daint.cscs.ch/), [jupyter-clariden.cscs.ch](https://jupyter-clariden.cscs.ch/) and [jupyter-santis.cscs.ch](https://jupyter-clariden.cscs.ch/), respectively. As the notebook servers are executed on compute nodes, you must have a project with compute resources available on the respective cluster.
8+
The service is accessed at [jupyter-daint.cscs.ch](https://jupyter-daint.cscs.ch/), [jupyter-clariden.cscs.ch](https://jupyter-clariden.cscs.ch/), [jupyter-santis.cscs.ch](https://jupyter-clariden.cscs.ch/) and [jupyter-eiger.cscs.ch](https://jupyter-eiger.cscs.ch), respectively. As the notebook servers are executed on compute nodes, you must have a project with compute resources available on the respective cluster.
99

1010
Once logged in, you will be redirected to the JupyterHub Spawner Options form, where typical job configuration options can be selected. These options might include the type and number of compute nodes, the wall time limit, and your project account.
1111

docs/alps/hardware.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -78,6 +78,17 @@ Each node contains four Grace-Hopper modules and four corresponding network inte
7878
x1100c0s6b1n0
7979
```
8080

81+
??? info "Core-to-core latency"
82+
The core-to-core latency on a Grace CPU (collected using the [`core-to-core-latency`](https://github.com/nviennot/core-to-core-latency) program):
83+
84+
![](../images/alps/grace-core-to-core-latency.svg)
85+
86+
The latencies between the first cores on each of the four Grace CPUs within a node:
87+
88+
![](../images/alps/grace-core-to-core-latency-cross-socket.svg)
89+
90+
Note the significantly higher latencies compared to within a single Grace CPU.
91+
8192
[](){#ref-alps-zen2-node}
8293
### AMD Rome CPU Nodes
8394

docs/clusters/daint.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -127,7 +127,7 @@ See the Slurm documentation for instructions on how to run jobs on the [Grace-Ho
127127

128128
### FirecREST
129129

130-
Daint can also be accessed using [FirecREST][ref-firecrest] at the `https://api.cscs.ch/ml/firecrest/v2` API endpoint.
130+
Daint can also be accessed using [FirecREST][ref-firecrest] at the `https://api.cscs.ch/hpc/firecrest/v2` API endpoint.
131131

132132
!!! warning "The FirecREST v1 API is still available, but deprecated"
133133

docs/clusters/eiger.md

Lines changed: 7 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -158,12 +158,15 @@ There are multiple [Slurm partitions][ref-slurm-partitions] on the system:
158158

159159
See the Slurm documentation for instructions on how to run jobs on the [AMD CPU nodes][ref-slurm-amdcpu].
160160

161-
### Jupyter and FirecREST
161+
### JupyterHub
162162

163-
!!! under-construction "Jupyter is not yet available"
164-
[Jupyter][ref-jupyter] has not yet been configured on `Eiger.Alps`.
163+
A [JupyterHub][ref-jupyter] service for Eiger is available at [https://jupyter-eiger.cscs.ch](https://jupyter-eiger.cscs.ch).
165164

166-
**It will be deployed as soon as possible and this documentation will be updated accordingly**
165+
### FirecREST
166+
167+
Eiger can also be accessed using [FirecREST][ref-firecrest] at the `https://api.cscs.ch/hpc/firecrest/v2` API endpoint.
168+
169+
!!! warning "The FirecREST v1 API is still available, but deprecated"
167170

168171
## Maintenance and status
169172

docs/images/alps/grace-core-to-core-latency-cross-socket.svg

Lines changed: 304 additions & 0 deletions
Loading

docs/images/alps/grace-core-to-core-latency.svg

Lines changed: 12585 additions & 0 deletions
Loading

docs/running/slurm.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -300,8 +300,10 @@ if [[ $SLURM_LOCALID -eq 0 ]]; then
300300
CUDA_VISIBLE_DEVICES=0,1,2,3 nvidia-cuda-mps-control -d
301301
fi
302302

303-
# Set CUDA device
304-
numa_nodes=$(hwloc-calc --physical --intersect NUMAnode $(hwloc-bind --get --taskset))
303+
# Set CUDA device. Disable HWLOC_KEEP_NVIDIA_GPU_NUMA_NODES to avoid GPU NUMA
304+
# nodes appearing in the list of CUDA devices. They start appearing in hwloc
305+
# version 2.11.
306+
numa_nodes=$(HWLOC_KEEP_NVIDIA_GPU_NUMA_NODES=0 hwloc-calc --physical --intersect NUMAnode $(hwloc-bind --get --taskset))
305307
export CUDA_VISIBLE_DEVICES=$numa_nodes
306308

307309
# Wait for MPS to start

docs/software/uenv/index.md

Lines changed: 5 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -52,16 +52,11 @@ $ uenv --version
5252

5353
On Alps clusters the most recent version 8.1.0 is installed.
5454

55-
??? warning "Out of date uenv version on Eiger and Balfrin"
55+
??? warning "Out of date uenv version on Balfrin"
5656

57-
The uenv tool available on Eiger and Balfrin is a different version than the one described below, and some commands will be different to those documented here.
57+
The uenv tool available on Balfrin is a different version than the one described below, and some commands will be different to those documented here.
5858

59-
!!! note
60-
This note only applies to the current `eigen.cscs.ch` deployment.
61-
62-
The new [`eiger.alps.cscs.ch`][ref-cluster-eiger] deployment has version 8.1.0 of uenv installed.
63-
64-
Please refer to `uenv --help` for the correct usage on these systems.
59+
Please refer to `uenv --help` for the correct usage on Balfrin.
6560

6661
## Finding uenv
6762

@@ -121,8 +116,8 @@ The search can be refined by providing a [label][ref-uenv-labels].
121116

122117
## Downloading uenv
123118

124-
??? note "Using uenv for the first time on Balfrin and Eiger"
125-
With the old version of uenv installed on Balfrin and Eiger, before downloading your first image, a local directory for storing the images must first be created, otherwise you will receive an error message that the repository does not exist.
119+
??? warning "Using uenv for the first time on Balfrin"
120+
With the old version of uenv installed on Balfrin, before downloading your first image, a local directory for storing the images must first be created, otherwise you will receive an error message that the repository does not exist.
126121

127122
To create a repo in the default location, use the following command:
128123

0 commit comments

Comments
 (0)