eth-cscs
diff --git a/‎.github/CODEOWNERS‎
Lines changed: 1 addition & 2 deletions b/‎.github/CODEOWNERS‎
Lines changed: 1 addition & 2 deletions
diff --git a/‎docs/clusters/santis.md‎
Lines changed: 2 additions & 5 deletions b/‎docs/clusters/santis.md‎
Lines changed: 2 additions & 5 deletions
diff --git a/‎docs/guides/gb2025.md‎
Lines changed: 0 additions & 83 deletions b/‎docs/guides/gb2025.md‎
Lines changed: 0 additions & 83 deletions
diff --git a/‎docs/guides/storage.md‎
Lines changed: 112 additions & 0 deletions b/‎docs/guides/storage.md‎
Lines changed: 112 additions & 0 deletions
diff --git a/‎docs/policies/support.md‎
Lines changed: 4 additions & 1 deletion b/‎docs/policies/support.md‎
Lines changed: 4 additions & 1 deletion
diff --git a/‎docs/running/slurm.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/running/slurm.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/services/firecrest.md‎
Lines changed: 2 additions & 1 deletion b/‎docs/services/firecrest.md‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎docs/software/communication/cray-mpich.md‎
Lines changed: 2 additions & 0 deletions b/‎docs/software/communication/cray-mpich.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/software/communication/libfabric.md‎
Lines changed: 17 additions & 0 deletions b/‎docs/software/communication/libfabric.md‎
Lines changed: 17 additions & 0 deletions
diff --git a/‎docs/software/communication/nccl.md‎
Lines changed: 75 additions & 4 deletions b/‎docs/software/communication/nccl.md‎
Lines changed: 75 additions & 4 deletions
@@ -1,7 +1,6 @@
 * @bcumming @msimberg @RMeli
 docs/services/firecrest @jpdorsch @ekouts
-docs/software/communication @msimberg
+docs/software/communication @Madeeks @msimberg
 docs/software/devtools/linaro @jgphpc
 docs/software/prgenv/linalg.md @finkandreas @msimberg
 docs/software/sciapps/cp2k.md @abussy @RMeli
-docs/software/sciapps/lammps.md @nickjbrowning
@@ -7,10 +7,7 @@ Santis is an Alps cluster that provides GPU accelerators and file systems design
 
 ### Compute nodes
 
-Santis consists of around 600 [Grace-Hopper nodes][ref-alps-gh200-node].
-
-!!! note
-    In late March 2025 Santis was temporarily expanded to 1233 nodes for [Gordon Bell and HPL runs][ref-gb2025].
+Santis consists of around 430 [Grace-Hopper nodes][ref-alps-gh200-node].
 
 The number of nodes can change when nodes are added or removed from other clusters on Alps.
 
@@ -19,7 +16,7 @@ You will be assigned to one of the four login nodes when you ssh onto the system
 
 | node type | number of nodes | total CPU sockets | total GPUs |
 |-----------|-----------------| ----------------- | ---------- |
-| [gh200][ref-alps-gh200-node] | 600 | 2,400      | 2,400 |
+| [gh200][ref-alps-gh200-node] | 430 | 1,720      | 1,720 |
 
 ### Storage and file systems
 
 
@@ -1,6 +1,118 @@
 [](){#ref-guides-storage}
 # Storage
 
+[](){#ref-guides-storage-sharing}
+## Sharing files and data
+
+Newly created user folders are not accessible by other groups or users on CSCS systems.
+Linux [Access Control Lists](https://www.redhat.com/en/blog/linux-access-control-lists) (ACLs) let you grant access to one or more groups or users.
+
+In traditional POSIX, access permissions are granted to `user/group/other` in mode `read`/`write`/`execute`.
+The permissions can be checked with the `-l`  option of the command `ls`.
+For instance, if `user1` owns the folder `test`, the output would be the following:
+
+```console title="Checking posix permissions with ls"
+$ ls -lahd test/
+drwxr-xr-x 2 user1 csstaff 4.0K Feb 23 13:46 test/
+```
+
+ACLs are an extension of these permissions to give one or more users or groups access to your data.
+The ACLs of the same `test` folder of `user1` can be shown with the command `getfacl`:
+
+```console title="Checking permissions with getfacl"
+$ getfacl test
+# file: test
+# owner: user1
+# group: csstaff
+user::rwx
+group::r-x
+other::r-x
+```
+
+The command `setfacl` is used to change ACLs for a file or directory.
+
+To add users or groups to read/write/execute on a selected file or folder, use the `-M,--modify-file` or `-m,--modify` flags to modify the ACL of a file or directory.
+
+!!! example "give user2 read+write access to test"
+    Where `test` is owned by `user1`.
+    ```console
+    $ setfacl -m user:user2:rw test/
+
+    $ getfacl test/
+    # file: test
+    # owner: user1
+    # group: csstaff
+    user::rwx
+    user:user2:rw
+    group::r-x
+    mask::rwx
+    other::r-x
+    ```
+
+The `-X,--remove-file` and  `-x,--remove` options will remove ACL entries.
+
+!!! example "remove user2 access to test"
+    This reverts the access that was granted in the previous example.
+    ```console
+    $ setfacl -x user:user2 test/
+
+    $ getfacl test/
+    # file: test
+    # owner: user1
+    # group: csstaff
+    user::rwx
+    group::r-x
+    mask::rwx
+    other::r-x
+    ```
+
+Access rights can also be granted recursively to a folder and its children (if they exist) using the option `-R,--recursive`.
+
+!!! note
+    This applies only to existing files - files added after this call won't inherit the permissions.
+
+!!! example "recursively grant user2 access to test and its contents"
+    ```console
+    $ setfacl -Rm user:user2 test
+
+    $ getfacl test/subdir
+    # file: test/subdir
+    # owner: user1
+    # group: csstaff
+    user::rwx
+    user:user2:rwx
+    group::---
+    group:csstaff:r-x
+    mask::rwx
+    other::---
+    ```
+
+To set up a default so all newly created folders and dirs inside or your desired path will inherit the permissions, use the `-d,--default` option.
+
+!!! example "recursively grant user2 access to test and its contents"
+    `user2` will have access to files created inside `test` after this call:
+
+    ```console
+    $ setfacl -dm user:user2:rw test/
+
+    $ getfacl test
+    # file: test
+    # owner: user1
+    # group: csstaff
+    user::rwx
+    group::r-x
+    mask::rwx
+    other::r-x
+    default:user::rwx
+    default:user:user2:rw
+    default:group::r-x
+    default:mask::rwx
+    default:other::r-x
+    ```
+
+!!! info
+    For more information read the setfacl man page: `man setfacl`.
+
 ## Many small files vs. HPC File Systems
 
 Workloads that read or create many small files are not well-suited to parallel file systems, which are designed for parallel and distributed I/O.
 
@@ -1,4 +1,5 @@
-# UserLab Support Policy
+[](){#ref-support}
+# User Support Policy
 
 ## 1. User Support Policy
 
@@ -23,6 +24,7 @@ CSCS reserves the right to decline support for requests that fall outside the sc
 Support will be focused on ensuring that the resources are used in alignment with the approved objectives and goals.
 Requests that significantly deviate from the original proposal may not be accommodated.
 
+[](){#ref-support-user-apps}
 ## 3. User Applications
 
 User applications are those brought to CSCS systems by the users, whether they are developed by the users themselves or another third-party.
@@ -32,6 +34,7 @@ CSCS will provide guidance on deploying applications on our systems, including c
 While we can assist with infrastructure-related issues, we can not configure, optimize, debug, or fix the applications themselves.
 Users are responsible for resolving application-specific issues themselves or contacting the respective developers.
 
+[](){#ref-support-apps}
 ## 4. Officially Supported Applications
 
 CSCS offers a range of officially supported applications and their respective versions and configurations, which are packaged and released by CSCS or its supply partners.
 
@@ -75,7 +75,7 @@ In these cases SLURM jobs must be configured to assign multiple ranks to a singl
 This is best done using [NVIDIA's Multi-Process Service (MPS)].
 To use MPS, launch your application using the following wrapper script, which will start MPS on one rank per node and assign GPUs to ranks according to the CPU mask of a rank, ensuring the closest GPU is used:
 
-```bash
+```bash title="mps-wrapper.sh"
 #!/bin/bash
 # Example mps-wrapper.sh usage:
 # > srun [srun args] mps-wrapper.sh [cmd] [cmd args]
 
@@ -45,7 +45,8 @@ FirecREST is available for all three major [Alps platforms][ref-alps-platforms],
 <tr><th>Platform</th><th>Version</th><th>API Endpoint</th><th>Clusters</th></tr>
 <tr><td style="vertical-align: middle;" rowspan="2">HPC Platform</td><td>v1</td><td>https://api.cscs.ch/hpc/firecrest/v1</td><td style="vertical-align: middle;" rowspan="2"><a href="../../clusters/daint">Daint</a>, <a href="../../clusters/eiger">Eiger</a></td></tr>
 <tr>                                 <td>v2</td><td>https://api.cscs.ch/hpc/firecrest/v2</td></tr>
-<tr><td>ML Platform</td><td>v1</td><td>https://api.cscs.ch/ml/firecrest/v1</td><td style="vertical-align: middle;"><a href="../../clusters/bristen">Bristen</a>, <a href="../../clusters/clariden">Clariden</a></td></tr>
+<tr><td style="vertical-align: middle;" rowspan="2">ML Platform</td><td>v1</td><td>https://api.cscs.ch/ml/firecrest/v1</td><td style="vertical-align: middle;" rowspan="2"><a href="../../clusters/bristen">Bristen</a>, <a href="../../clusters/clariden">Clariden</a></td></tr>
+<tr>                                 <td>v2</td><td>https://api.cscs.ch/ml/firecrest/v2</td></tr>
 <tr><td style="vertical-align: middle;" rowspan="2">CW Platform</td><td>v1</td><td>https://api.cscs.ch/cw/firecrest/v1</td><td style="vertical-align: middle;" rowspan="2"><a href="../../clusters/santis">Santis</a></td></tr>
 <tr><td>v2</td><td>https://api.cscs.ch/cw/firecrest/v2</td></tr>
 </table>
 
@@ -58,12 +58,14 @@ See [this page][ref-slurm-gh200] for more information on configuring SLURM to us
 
     Alternatively, if you wish to not use GPU-aware MPI, either unset `MPICH_GPU_SUPPORT_ENABLED` or explicitly set it to `0` in your launch scripts.
 
+[](){#ref-communication-cray-mpich-known-issues}
 ## Known issues
 
 This section documents known issues related to Cray MPICH on Alps. Resolved issues are also listed for reference.
 
 ### Existing Issues
 
+[](){#ref-communication-cray-mpich-cache-monitor-disable}
 #### Cray MPICH hangs
 
 Cray MPICH may sometimes hang on larger runs.
 
@@ -4,4 +4,21 @@
 [Libfabric](https://ofiwg.github.io/libfabric/), or Open Fabrics Interfaces (OFI), is a low level networking library that abstracts away various networking backends.
 It is used by Cray MPICH, and can be used together with OpenMPI, NCCL, and RCCL to make use of the [Slingshot network on Alps][ref-alps-hsn].
 
+## Using libfabric
+
+If you are using a uenv provided by CSCS, such as [prgenv-gnu][ref-uenv-prgenv-gnu], [Cray MPICH][ref-communication-cray-mpich] is linked to libfabric and the high speed network will be used.
+No changes are required in applications.
+
+If you are using containers, the system libfabric can be loaded into your container using the [CXI hook provided by the container engine][ref-ce-cxi-hook].
+Using the hook is essential to make full use of the Alps network.
+
+## Tuning libfabric
+
+Tuning libfabric (particularly together with [Cray MPICH][ref-communication-cray-mpich], [OpenMPI][ref-communication-openmpi], [NCCL][ref-communication-nccl], and [RCCL][ref-communication-rccl]) depends on many factors, including the application, workload, and system.
+For a comprehensive overview libfabric options for the CXI provider (the provider for the Slingshot network), see the [`fi_cxi` man pages](https://ofiwg.github.io/libfabric/v2.1.0/man/fi_cxi.7.html).
+Note that the exact version deployed on Alps may differ, and not all options may be applicable on Alps.
+
+See the [Cray MPICH known issues page][ref-communication-cray-mpich-known-issues] for issues when using Cray MPICH together with libfabric.
+
 !!! todo
+    More options?
@@ -4,7 +4,78 @@
 [NCCL](https://developer.nvidia.com/nccl) is an optimized inter-GPU communication library for NVIDIA GPUs.
 It is commonly used in machine learning frameworks, but traditional scientific applications can also benefit from NCCL.
 
-!!! todo
-    - high level description
-    - libfabric/aws-ofi-nccl plugin
-    - configuration options
+## Using NCCL
+
+To use the Slingshot network on Alps, the [`aws-ofi-nccl`](https://github.com/aws/aws-ofi-nccl) plugin must be used.
+With the container engine, the [AWS OFI NCCL hook][ref-ce-aws-ofi-hook] can be used to load the plugin into the container and configure NCCL to use it.
+
+Most uenvs, like [`prgenv-gnu`][ref-uenv-prgenv-gnu], also contain the NCCL plugin.
+When using e.g. the `default` view of `prgenv-gnu` the `aws-ofi-nccl` plugin will be available in the environment.
+Alternatively, loading the `aws-ofi-nccl` module with the `modules` view also makes the plugin available in the environment.
+The environment variables described below must be set to ensure that NCCL uses the plugin.
+
+While the container engine sets these automatically when using the NCCL hook, the following environment variables should always be set for correctness and optimal performance when using NCCL:
+
+```bash
+export NCCL_NET="AWS Libfabric" # (1)!
+export NCCL_NET_GDR_LEVEL=PHB # (2)!
+export FI_CXI_DEFAULT_CQ_SIZE=131072 # (3)!
+export FI_CXI_DEFAULT_TX_SIZE=32768
+export FI_CXI_DISABLE_HOST_REGISTER=1
+export FI_CXI_RX_MATCH_MODE=software
+export FI_MR_CACHE_MONITOR=userfaultfd
+export MPICH_GPU_SUPPORT_ENABLED=0 # (4)!
+```
+
+1. This forces NCCL to use the libfabric plugin, enabling full use of the Slingshot network. If the plugin can not be found, applications will fail to start. With the default value, applications would instead fall back to e.g. TCP, which would be significantly slower than with the plugin. [More information about `NCCL_NET`](https://docs.nvidia.com/deeplearning/nccl/user-guide/docs/env.html#nccl-net).
+2. Use GPU Direct RDMA when GPU and NIC are on the same NUMA node. [More information about `NCCL_NET_GDR_LEVEL`](https://docs.nvidia.com/deeplearning/nccl/user-guide/docs/env.html#nccl-net-gdr-level-formerly-nccl-ib-gdr-level).
+3. This and the other `FI` (libfabric) environment variables have been found to give the best performance on the Alps network across a wide range of applications. Specific applications may perform better with other values.
+4. Disable GPU-aware MPI explicitly, to avoid potential deadlocks between MPI and NCCL.
+
+!!! warning "Using NCCL with uenvs"
+    The environment variables listed above are not set automatically when using uenvs.
+
+!!! warning "GPU-aware MPI with NCCL"
+    Using GPU-aware MPI together with NCCL [can easily lead to deadlocks](https://docs.nvidia.com/deeplearning/nccl/user-guide/docs/mpi.html#inter-gpu-communication-with-cuda-aware-mpi).
+    Unless care is taken to ensure that the two methods of communication are not used concurrently, we recommend not using GPU-aware MPI with NCCL.
+    To explicitly disable GPU-aware MPI with Cray MPICH, explicitly set `MPICH_GPU_SUPPORT_ENABLED=0`.
+    Note that this option may be set to `1` by default on some Alps clusters.
+    See [the Cray MPICH documentation][ref-communication-cray-mpich] for more details on GPU-aware MPI with Cray MPICH.
+
+!!! warning "`invalid usage` error with `NCCL_NET="AWS Libfabric`"
+    If you are getting error messages such as:
+    ```console
+    nid006352: Test NCCL failure common.cu:958 'invalid usage (run with NCCL_DEBUG=WARN for details)
+    ```
+    this may be due to the plugin not being found by NCCL.
+    If this is the case, running the application with the recommended `NCCL_DEBUG=WARN` should print something similar to the following:
+    ```console
+    nid006352:34157:34217 [1] net.cc:626 NCCL WARN Error: network AWS Libfabric not found.
+    ```
+    When using uenvs like `prgenv-gnu`, make sure you are either using the `default` view which loads `aws-ofi-nccl` automatically, or, if using the `modules` view, load the `aws-ofi-nccl` module with `module load aws-ofi-nccl`.
+    If the plugin is found correctly, running the application with `NCCL_DEBUG=INFO` should print:
+    ```console
+    nid006352:34610:34631 [0] NCCL INFO Using network AWS Libfabric
+    ```
+
+!!! warning "Do not use `NCCL_NET_PLUGIN="ofi"` with uenvs"
+    NCCL has an alternative way of specifying what plugin to use: `NCCL_NET_PLUGIN`.
+    When using uenvs, do not set `NCCL_NET_PLUGIN="ofi"` instead of, or in addition to, `NCCL_NET="AWS Libfabric"`.
+    If you do, your application will fail to start since NCCL will:
+
+    1. fail to find the plugin because of the name of the shared library in the uenv, and
+    2. prefer `NCCL_NET_PLUGIN` over `NCCL_NET`, so it will fail to find the plugin even if `NCCL_NET="AWS Libfabric"` is correctly set.
+    
+    When both environment variables are set the error message, with `NCCL_DEBUG=WARN`, will look similar to when the plugin isn't available:
+    ```console
+    nid006365:179857:179897 [1] net.cc:626 NCCL WARN Error: network AWS Libfabric not found.
+    ```
+    
+    With `NCCL_DEBUG=INFO`, NCCL will print:
+    ```console
+    nid006365:180142:180163 [0] NCCL INFO NET/Plugin: Could not find: ofi libnccl-net-ofi.so. Using internal network plugin.
+    ...
+    nid006365:180142:180163 [0] net.cc:626 NCCL WARN Error: network AWS Libfabric not found.
+    ```
+    
+    If you only set `NCCL_NET="ofi"`, NCCL may silently fail to load the plugin but fall back to the default implementation.