Skip to content

Commit d00de38

Browse files
committed
node-images
1 parent c9e374a commit d00de38

File tree

2 files changed

+2702
-9
lines changed

2 files changed

+2702
-9
lines changed

docs/alps/hardware.md

Lines changed: 26 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -40,20 +40,37 @@ Alps was installed in phases, starting with the installation of 1024 AMD Rome du
4040

4141
There are currently four node types in Alps, with another becoming available in 2025:
4242

43-
| type | blades | nodes | CPU sockets | GPU devices |
44-
| ---- | ------:| -----:| -----------:| -----------:|
45-
| NVIDIA GH200 | 1344 | 2688 | 10,752 | 10,752 |
46-
| AMD Rome | 256 | 1024 | 2,048 | -- |
47-
| NVIDIA A100 | 72 | 144 | 144 | 576 |
48-
| AMD MI250x | 12 | 24 | 24 | 96 |
49-
| AMD MI300A | 64 | 128 | 512 | 512 |
43+
| type | abbreviation | blades | nodes | CPU sockets | GPU devices |
44+
| ---- | ------- | ------:| -----:| -----------:| -----------:|
45+
| NVIDIA GH200 | gh200 | 1344 | 2688 | 10,752 | 10,752 |
46+
| AMD Rome | zen2 | 256 | 1024 | 2,048 | -- |
47+
| NVIDIA A100 | a100 | 72 | 144 | 144 | 576 |
48+
| AMD MI250x | mi200 | 12 | 24 | 24 | 96 |
49+
| AMD MI300A | mi300 | 64 | 128 | 512 | 512 |
5050

5151
[](){#ref-alps-gh200-node}
5252
### NVIDIA GH200 GPU Nodes
5353

54-
!!! todo
54+
There are 24 cabinets, in 4 rows with 6 cabinets per row:
55+
56+
* 8 chassis per cabinet
57+
* 7 blades per chassis
58+
* a chassis can contain up to 8 blades, however Alps' gh200 chassis are underpopulated so that we can increase the amount of power delivered to each node.
59+
* 2 nodes per blade
60+
61+
Each node contains four Grace-Hopper modules and four corresponding network interface cards (NICS) per blade, as illustrated below:
62+
63+
![](../images/alps/gh200-schematic.svg)
64+
65+
??? info "node xnames"
66+
There are two boards per blade with one node per board.
67+
This is different to the `zen2` CPU-only nodes (used for example in Eiger) that had two nodes per board for a total of four nodes per blade.
68+
As such, there are no `n1` nodes in the xname list, e.g.:
69+
```
70+
x1100c0s6b0n0
71+
x1100c0s6b1n0
72+
```
5573

56-
Blanca Peak
5774

5875
[](){#ref-alps-zen2-node}
5976
### AMD Rome CPU Nodes

0 commit comments

Comments
 (0)