Skip to content

Commit c2d2f9c

Browse files
authored
h200 nodes (#754)
1 parent a92f6b6 commit c2d2f9c

File tree

3 files changed

+6
-2
lines changed

3 files changed

+6
-2
lines changed

triton/ref/gpu.rst

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,8 @@
99
Tesla V100 | ``gpu-v100-32g`` | ``volta`` | ``v100`` | 40 | gpu[28-37] | Volta | 5120 | 32GB | 7.0
1010
Tesla V100 | ``gpu-v100-16g`` | ``volta`` | ``v100`` | 16 | dgx[1-2] | Volta | 5120 | 16GB | 7.0
1111
Tesla V100 | ``gpu-v100-32g`` | ``volta`` | ``v100`` | 16 | dgx[3-7] | Volta | 5120 | 32GB | 7.0
12-
Tesla A100 | ``gpu-a100-80g`` | ``ampere`` | ``a100`` | 56 | gpu[11-17,38-44] | Ampere | 7936 | 80GB | 8.0
12+
Tesla A100 | ``gpu-a100-80g`` | ``ampere`` | ``a100`` | 56 | gpu[11-17,38-44] | Ampere | 7936 | 80GB | 8.0
1313
Tesla H100 | ``gpu-h100-80g`` | ``hopper`` | ``h100`` | 16 | gpu[45-48] | Hopper | 16896 | 80GB | 9.0
14+
Tesla H200 | ``gpu-h200-18g-ia`` | ``hopper`` | ``h200-18g`` | 56 | gpu[49] | Hopper | | 18GB | 9.0
15+
Tesla H200 | ``gpu-h200-141g`` | ``hopper`` | ``h200`` | 16 | gpu[50-51] | Hopper | | 141GB | 9.0
1416
AMD MI100 (testing) | *Not yet installed* | ``mi100`` | Use ``-p gpu-amd`` only, no ``--gres`` | | gpuamd[1] |

triton/ref/hardware.rst

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,4 +19,6 @@
1919
dgx[3-7] | 5 | Nvidia DGX-1 | 2018 | bdw avx2 volta | 2x20 core `Xeon E5-2698 v4 @ 2.2GHz <https://ark.intel.com/products/91753/Intel-Xeon-Processor-E5-2698-v4-50M-Cache-2_20-GHz>`__ | 512GB DDR4-2133 | EDR | 8x `V100 <https://www.nvidia.com/en-us/data-center/v100/>`__ 32GB| 7 TB SSD
2020
gpuamd1 | 1 | Dell PowerEdge R7525 | 2021 | rome avx2 mi100 | 2x8 core AMD EPYC 7262 @3.2GHz | 250GB DDR4-3200 | EDR | 3x `MI100 <https://www.amd.com/en/products/server-accelerators/instinct-mi100>`__ | 32GB SSD
2121
gpu[45-48] | 4 | Dell PowerEdge XE8640 | 2024 | saphr avx2 h100 hopper | 2x48 core `Xeon Platinum 8468 <https://www.intel.com/content/www/us/en/products/sku/231735/intel-xeon-platinum-8468-processor-105m-cache-2-10-ghz/specifications.html>`__ 2.1GHz | 1024GB DDR5-4800 | HDR | 4x `H100 SXM <https://www.nvidia.com/en-us/data-center/h100/>`__ 80GB | 21 TB SSD
22+
gpu[49] | 1 | Dell PowerEdge XE9680 | 2024 | emerald avx2 h200 hopper | 2x32 core `Xeon® Platinum 8562Y+ <https://www.intel.com/content/www/us/en/products/sku/237558/intel-xeon-platinum-8562y-processor-60m-cache-2-80-ghz/specifications.html>`__ 2.8GHz | 2048GB DDR5-5600 | HDR | 8x `H200 SXM <https://www.nvidia.com/en-us/data-center/h200/>`__ each split to 7x18GB | 20 TB SSD
23+
gpu[50-51] | 2 | Dell PowerEdge XE9680 | 2024 | emerald avx2 h200 hopper | 2x32 core `Xeon® Platinum 8562Y+ <https://www.intel.com/content/www/us/en/products/sku/237558/intel-xeon-platinum-8562y-processor-60m-cache-2-80-ghz/specifications.html>`__ 2.8GHz | 2048GB DDR5-5600 | HDR | 8x `H200 SXM <https://www.nvidia.com/en-us/data-center/h200/>`__ 141GB | 20 TB SSD
2224

triton/tut/gpu.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -267,7 +267,7 @@ For GPUs in Triton these flags are:
267267
-arch=sm_60 -gencode=arch=compute_60,code=sm_60 -gencode=arch=compute_70,code=sm_70 -gencode=arch=compute_80,code=sm_80 -gencode=arch=compute_90,code=sm_90
268268
269269
Here architectures (``compute_XX``/``sm_XX``) number 60, 70, 80 and 90
270-
correspond to GPU cards P100, V100, A100 and H100 respectively.
270+
correspond to GPU cards P100, V100, A100 and H100/H200 respectively.
271271

272272
For more information, you can check this
273273
`excellent article <https://arnon.dk/matching-sm-architectures-arch-and-gencode-for-various-nvidia-cards/>`__

0 commit comments

Comments
 (0)