Skip to content

Commit 0927f3d

Browse files
committed
more work on clariden
1 parent c10fc94 commit 0927f3d

File tree

3 files changed

+22
-30
lines changed

3 files changed

+22
-30
lines changed

docs/tools/container-engine.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
[](){#container-engine}
1+
[](){#ref-container-engine}
22
# Container Engine
33

44
The Container Engine (CE) toolset is designed to enable computing jobs to seamlessly run inside Linux application containers, thus providing support for containerized user environments.

docs/tools/slurm.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,12 @@ SLURM is an open-source, highly scalable job scheduler that allocates computing
99
!!! todo
1010
document `--account`, `--constrant` and other generic flags.
1111

12+
[](){#ref-slurm-running-jobs}
13+
## Running jobs
14+
15+
!!! todo
16+
document `srun --pty`, `sbatch`, `squeue`, `scontrol`, `salloc` with a few common usage examples
17+
1218
[](){#ref-slurm-partitions}
1319
## Partitions
1420

docs/vclusters/clariden.md

Lines changed: 15 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -18,11 +18,13 @@ As usual the login nodes have direct internet connections, while the compute nod
1818
### File systems and storage
1919
The scratch filesystem is hosted on [IOPStore][ref-storage-iopstor], but also the capacity storage [Capstor][ref-storage-capstor] is mounted at `/capstor/scratch/cscs`.
2020
The variables `STORE` and `PROJECT` are not set on Clariden.
21-
!!! todo
22-
verify this claim after maintenance window
23-
The home directory is hosted on [VAST][ref-storage-vast]. As usual, an overview of your quota on the different filesystems can be obtained by the `quota` command.
21+
The home directory is hosted on [VAST][ref-storage-vast].
22+
23+
As usual, an overview of your quota on the different filesystems, can be obtained by the `quota` command.
2424

2525
## Getting started
26+
The project and resources are managed by [this tool][ref-account-waldur].
27+
2628
### Connect to Clariden
2729
You can connect to Clariden via [ssh][ref-ssh-config], ensuring that the file `~/.ssh/config` has these settings (replace `cscsusername` with your username).
2830

@@ -41,44 +43,28 @@ Host clariden
4143
```
4244
You can then use `ssh clariden` to login to Clariden.
4345

44-
!!! todo
45-
how to log in, i.e. `ssh clariden.cscs.ch` via `ela.cscs.ch`
46-
47-
provide the snippet to add to your `~/.ssh/config`, and link to where we document this (docs not currently available)
46+
### Available programming environments
4847

49-
## Software and services
48+
#### Container engine
49+
The recommended way for working on Clariden are containerized workflows leveraging the [container engine][ref-container-engine].
5050

51-
!!! todo
52-
information about CSCS services/tools available
51+
#### UENV
52+
Besides running containerized workflows, it is possible to run your jobs with a [UENV][ref-tool-uenv].
5353

54-
* container engine
55-
* uenv
56-
* CPE
57-
* ... etc
54+
#### CPE
55+
Unlike on other platforms the Cray programming environment is not supported on Clariden.
5856

59-
## Running Jobs on Clariden
57+
### Running Jobs on Clariden
6058

6159
Clariden uses [SLURM][slurm] as the workload manager, which is used to launch and monitor distributed workloads, such as training runs.
6260

6361
See detailed instructions on how to run jobs on the [Grace-Hopper nodes][ref-slurm-gh200].
6462

65-
## Storage
66-
67-
!!! todo
68-
describe the file systems that are attached, and where.
69-
70-
This is where `$SCRATCH`, `$PROJECT` etc are defined for this cluster.
71-
72-
Refer to the specific file systems that these map onto (capstor, iopstor, waldur), and link to the storage docs for these.
73-
74-
Also discuss any specific storage policies. You might want to discuss storage policies for MLp one level up, in the [MLp docs][ref-platform-mlp].
75-
76-
* attached storage and policies
63+
The flag `--account=<account>` / `-A <account>` is mandatory for submitting jobs to SLURM, and nodehour accounting will be on the account that is specified with this flag.
7764

7865
## Calendar and key events
7966

80-
The system is updated every Tuesday, between 9 am and 12 pm.
81-
...
67+
The system is updated every Wednesday, between 8 am and 12 pm Zurich local time.
8268

8369
!!!todo
8470
notifications

0 commit comments

Comments
 (0)