Skip to content

Commit 5a8cb8b

Browse files
committed
refactor layout around alps and platforms
1 parent 6a58959 commit 5a8cb8b

File tree

13 files changed

+177
-87
lines changed

13 files changed

+177
-87
lines changed

docs/access/ump.md

Lines changed: 48 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,50 @@
1-
# User Management Portal
1+
[](){#ump}
2+
# Account and Resources Management Tool
23

3-
!!! todo
4-
copy over docs from [confluence](https://confluence.cscs.ch/display/KB/Account+and+Resources+Management+Tool)
4+
The Swiss National Supercomputing Centre (CSCS) offers a web-based tool for users to manage their accounts and projects at [account.cscs.ch](https://account.cscs.ch).
55

6-
15 minute job
6+
With this tool, users can:
7+
8+
- Access their profile, manage institutional details, or reset their password.
9+
- List the projects they belong to, including closed ones.
10+
- Check details on each project, quotas, and current utilization.
11+
- Get an overview of where their files are stored at CSCS (including home directories, scratch, etc.).
12+
13+
For group leaders (or PIs), the tool allows:
14+
15+
- Managing user membership and access control.
16+
- Inviting users to their projects via email. Existing users can accept immediately, while new users will receive instructions to create an account and join the project.
17+
- Removing users from their projects.
18+
- Selecting which users can access a system (and submit jobs) and which ones can only access project data.
19+
- Defining one or more deputies to perform such tasks.
20+
**Note:** The responsibility of what happens within the project still belongs to the group leader or PI.
21+
22+
A short guideline on how to perform these tasks is provided below.
23+
24+
## Usage
25+
26+
The tool is designed to be intuitive and comprises the following main areas:
27+
28+
- **A) Account selector**: For users with multiple accounts (e.g., service accounts).
29+
- **B) Profile management**: To view and edit the account's institutional details and change the password.
30+
- **C) Project membership**: To show the selected project in detail.
31+
- **D) Storage**: Where users can see where they have stored their files (home, scratch, and project areas).
32+
- **E) Main view**
33+
34+
![Screenshot](../images/access/ump.png)
35+
36+
### Membership Management (for Group Leaders and Deputies Only)
37+
38+
To invite users to a selected project, group leaders or their deputies need to:
39+
40+
1. Select the project on the left menu.
41+
2. Click the "Members" tab.
42+
3. Scroll down to the "Users" (or "Deputies" to manage deputies) section.
43+
4. Use the "+" (plus) button on the right of the section and enter the given and family names and email address of the invitee.
44+
The invitee will receive instructions on how to join the project. The group leader will get a confirmation on whether the invitee has accepted or rejected the invitation.
45+
If the invitee does not have an account, they will also receive instructions on how to create one, which needs to be verified by CSCS administration staff.
46+
47+
To remove users from a selected project, group leaders or their deputies need to:
48+
49+
1. Repeat steps 1 to 3 above.
50+
2. Use the icon with the three horizontal lines (see screenshot below) that is on the right of the user and select "Remove user."

docs/alps/hardware.md

Lines changed: 20 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,4 @@
1+
[](){#alps-hardware}
12
# Alps Hardware
23

34
Alps is a HPE Cray EX3000 system, a liquid cooled blade-based, high-density system.
@@ -20,7 +21,19 @@ This approach to cooling provides greater efficiency for the rack-level cooling,
2021
* Maximum of 64 quad-blade compute blades
2122
* Maximum of 64 Slingshot switch blades
2223

23-
## Alps Blades
24+
## Alps High Speed Network
25+
26+
!!! todo
27+
information about the network.
28+
29+
* Details about SlingShot 11.
30+
* how many NICS per node
31+
* raw feeds and speeds
32+
* Some OSU benchmark results.
33+
* GPU-aware communication
34+
* **slingshot is not infiniband - there is no NVSwitch**
35+
36+
## Alps Nodes
2437

2538
Alps was installed in phases, starting with the installation of 1024 AMD Rome dual socket CPU nodes in 2020, through to the main installation of 2,688 Grace-Hopper nodes in 2024.
2639

@@ -34,26 +47,31 @@ There are currently four node types in Alps, with another becoming available in
3447
| AMD MI250x | 12 | 24 | 24 | 96 |
3548
| AMD MI300A | 64 | 128 | 512 | 512 |
3649

50+
[](){#gh200-node}
3751
### NVIDIA GH200 GPU Nodes
38-
[](){#gh200-hardware-description}
3952

4053
Perry Peak
4154

55+
[](){#zen2-node}
4256
### AMD Rome CPU Nodes
4357

4458
EX425
4559

60+
[](){#a100-node}
4661
### NVIDIA A100 GPU Nodes
4762

4863
Grizzly Peak
4964

65+
[](){#mi200-node}
5066
### AMD MI250x GPU Nodes
5167

5268
Bard Peak
5369

70+
[](){#mi300-node}
5471
### AMD MI300A GPU Nodes
5572

5673
Parry Peak
5774

5875
!!! info "coming soon"
5976
H1 2025
77+

docs/alps/index.md

Lines changed: 18 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -1,39 +1,37 @@
11
# Alps Infrastructure
22

3-
Alps is a general-purpose compute and data Research Infrastructure (RI) open to the broad community of researchers in Switzerland and the rest of the world. Alps will provide a high impact, challenging and innovative RI that will allow Switzerland to advance science and impact society.
3+
Alps is a general-purpose compute and data Research Infrastructure (RI) open to the broad community of researchers in Switzerland and the rest of the world.
4+
Alps provides a high impact, challenging and innovative RI that will allows Switzerland to advance science and impact society.
45

5-
Alps enables the creation of versatile clusters (vClusters) that can be tailored to the specific needs of users while maintaining confidentiality. For example, a vCluster will be dedicated to MeteoSwiss’ numerical weather forecasts, another one to the User Lab and another one to Machine Learning and Artificial Intelligence.
6+
Alps enables the creation of versatile clusters (vClusters) that can be tailored to the specific needs of users while maintaining confidentiality.
7+
For example, a vCluster will be dedicated to MeteoSwiss’ numerical weather forecasts, another one to the User Lab and another one to Machine Learning and Artificial Intelligence.
8+
9+
A key feature of Alps is multi-tenancy, where tenants are organizations, typically a research institution, that deploys, operates, or manages its platform on the Alps infrastructure.
10+
Tenants have privileged access to resource nodes, enabling them to deploy their own services and resource configurations.
11+
Additionally, network segregation ensures secure and isolated communication, with the option to connect to the tenant's private network.
612

713
<div class="grid cards" markdown>
814

9-
- :fontawesome-solid-signs-post: __Hardware__
15+
- :fontawesome-solid-signs-post: __Platforms__
1016

11-
Learn about the node types and networking infrastructure in Alps.
17+
[:octicons-arrow-right-24: Alps Platforms][platforms]
1218

13-
[:octicons-arrow-right-24: Alps Hardware](hardware.md)
19+
- :fontawesome-solid-signs-post: __Clusters__
1420

15-
- :fontawesome-solid-signs-post: __Network__
21+
The resources on Alps are partitioned and configured into versatile software defined clusters (vClusters).
1622

17-
Learn about the Slingshot 11 network on Alps.
23+
[:octicons-arrow-right-24: Alps vClusters][clusters]
1824

19-
[:octicons-arrow-right-24: Alps Network](network.md)
25+
- :fontawesome-solid-signs-post: __Hardware__
26+
27+
Learn about the node types and networking infrastructure in Alps.
28+
29+
[:octicons-arrow-right-24: Alps Hardware](hardware.md)
2030

2131
- :fontawesome-solid-signs-post: __Storage__
2232

2333
Learn about the file systems attached to Alps.
2434

2535
[:octicons-arrow-right-24: Alps Storage](storage.md)
2636

27-
- :fontawesome-solid-signs-post: __vClusters__
28-
29-
The resources on Alps are partitioned and configured into versatile software defined clusters (vClusters).
30-
31-
[:octicons-arrow-right-24: Alps vClusters](vclusters.md)
32-
33-
- :fontawesome-solid-signs-post: __Tenants__
34-
35-
Alps is a multi-tenant system.
36-
37-
[:octicons-arrow-right-24: Alps Tenants](tenants.md)
38-
3937
</div>

docs/alps/platforms.md

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
[](){#platforms}
2+
# Platforms on Alps
3+
4+
A platform represents a set of scientific services along with compute and data resources hosted on the Alps research infrastructure, provided to a specific scientific community.
5+
Each platform addresses particular research needs and domains, such as climate and weather modeling, machine learning, or high-performance computing applications.
6+
A platform can consist of one or multiple [clusters][clusters], and its services can be managed either by CSCS or by the scientific community itself, including access control, usage policies, and support.
7+
8+
<div class="grid cards" markdown>
9+
10+
- :fontawesome-solid-mountain: __Machine Learning Platform__
11+
12+
The Machine Learning Platform (MLp) hosts ML and AI researchers.
13+
14+
[:octicons-arrow-right-24: MLp][mlp]
15+
16+
- :fontawesome-solid-mountain: __HPC Platform__
17+
18+
!!! todo
19+
20+
[:octicons-arrow-right-24: HPCp][hpcp]
21+
22+
- :fontawesome-solid-mountain: __Climate and Weather Platform__
23+
24+
!!! todo
25+
26+
[:octicons-arrow-right-24: CWp][cwp]
27+
28+
</div>

docs/alps/tenants.md

Lines changed: 0 additions & 7 deletions
This file was deleted.

docs/alps/vclusters.md

Lines changed: 30 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,34 @@
1-
# Alps vClusters
1+
[](){#clusters}
2+
# Alps Clusters
23

3-
!!! todo
4-
this page answers the question "what is a vCluster"?
4+
A vCluster (versatile software-defined cluster) is a logical partition of the supercomputing resources where platform services are deployed. It serves as a dedicated environment supporting a specific platform. The composition of resources and services for each vCluster is defined in a configuration file used by an automated pipeline for deployment. Once deployed by CSCS, the vCluster becomes immutable.
55

6-
* What is a vCluster?
7-
* Examples of vClusters
6+
## Clusters on Alps
7+
8+
Clusters on Alps are provided as part of different [platforms][platforms].
9+
10+
<div class="grid cards" markdown>
11+
- :fontawesome-solid-mountain: __Machine Learning Platform__
12+
13+
Clariden is the main Grace-Hopper cluster
14+
15+
[:octicons-arrow-right-24: Clariden][clariden]
16+
17+
Bristen is a small system with a100 nodes, used for **todo**
18+
19+
[:octicons-arrow-right-24: Bristen][bristen]
20+
</div>
21+
22+
<div class="grid cards" markdown>
23+
- :fontawesome-solid-mountain: __HPC Platform__ { .col-span-12 }
24+
25+
!!! todo
26+
</div>
27+
28+
<div class="grid cards" markdown>
29+
- :fontawesome-solid-mountain: __Climate and Weather Platform__
30+
31+
!!! todo
32+
</div>
833

9-
We don't document individual vClusters here - these are documented under each platform.
1034

docs/images/access/ump.png

60.4 KB
Loading

docs/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ Welcome to the techincal documentation for Alps.
88

99
Once you have a project at CSCS, start here to find your platform:
1010

11-
[:octicons-arrow-right-24: Platforms overview](platforms/index.md)
11+
[:octicons-arrow-right-24: Platforms overview][platforms]
1212

1313
Go straight to the documentation for the platform that hosts your project:
1414

docs/platforms/index.md

Lines changed: 0 additions & 26 deletions
This file was deleted.

docs/platforms/mlp/index.md

Lines changed: 11 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -21,8 +21,17 @@ Once invited to a project, you will receive an email, which you can need to crea
2121

2222
The main cluster provided by the MLp is Clariden, a large Grace-Hopper GPU system on Alps.
2323

24-
!!! todo
25-
introduction paragraph and cards that link to Clariden and Bristen
24+
<div class="grid cards" markdown>
25+
- :fontawesome-solid-mountain: [__Clariden__][clariden]
26+
27+
Clariden is the main [Grace-Hopper][gh200-node] cluster used for **todo**
28+
</div>
29+
30+
<div class="grid cards" markdown>
31+
- :fontawesome-solid-mountain: [__Bristen__][bristen]
32+
33+
Bristen is a smaller system with [A100 GPU nodes][a100-node] for **todo**
34+
</div>
2635

2736
## Guides and Tutorials
2837

0 commit comments

Comments
 (0)