Skip to content

Commit 23f870b

Browse files
authored
Merge branch 'main' into slurm/zen2
2 parents 38929e7 + eee97bb commit 23f870b

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

docs/alps/hardware.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -57,6 +57,7 @@ There are currently five node types in Alps:
5757
Please [get in touch](https://github.com/eth-cscs/cscs-docs/issues) if there is information that you want to see here.
5858

5959
There are 24 cabinets, in 4 rows with 6 cabinets per row, and each cabinet contains 112 nodes (for a total of 448 GH200):
60+
6061
* 8 chassis per cabinet
6162
* 7 blades per chassis
6263
* 2 nodes per blade

docs/software/communication/nccl.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ While the container engine sets these automatically when using the NCCL hook, th
3030
Note that this option may be set to `1` by default on some Alps clusters.
3131
See [the Cray MPICH documentation][ref-communication-cray-mpich] for more details on GPU-aware MPI with Cray MPICH.
3232

33-
!!! warning "`invalid usage` error with `NCCL_NET="AWS Libfabric`"
33+
!!! warning "`invalid usage` error with `NCCL_NET="AWS Libfabric"`"
3434
If you are getting error messages such as:
3535
```console
3636
nid006352: Test NCCL failure common.cu:958 'invalid usage (run with NCCL_DEBUG=WARN for details)

0 commit comments

Comments
 (0)