Skip to content

Commit 021c90d

Browse files
more a100 (#67) (#70)
Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com> Co-authored-by: Jason Zhou <jasonzho@nvidia.com>
1 parent 0d3c947 commit 021c90d

File tree

3 files changed

+4
-4
lines changed

3 files changed

+4
-4
lines changed

tests/cli/e2e_validation/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ python3 tests/cli/e2e_validation/e2e_runner.py --mode <smoke|selective|full> [op
4040
**Usage**: `python3 e2e_runner.py --mode full --parallel 4 --continue-on-error`
4141

4242
- ALL model combinations (~15 models)
43-
- ALL system configurations (h100_sxm, h200_sxm)
43+
- ALL system configurations (h100_sxm, h200_sxm, b200_sxm, gb200_sxm, a100_sxm)
4444
- ALL GPU configurations (8, 512 GPUs)
4545
- ALL workload patterns (3 ISL/OSL combinations)
4646
- ALL performance targets (2 TPOT values)
@@ -117,7 +117,7 @@ pipeline {
117117
The testing framework systematically validates:
118118

119119
- **15+ Models**: QWEN, LLAMA, DEEPSEEK, MOE variants
120-
- **2 Systems**: h100_sxm, h200_sxm, b200_sxm, gb200_sxm
120+
- **2 Systems**: h100_sxm, h200_sxm, b200_sxm, gb200_sxm, a100_sxm
121121
- **2 GPU Configs**: 8 GPUs, 512 GPUs
122122
- **3 Workload Patterns**: (4000,1000), (1000,2), (32,1000) ISL/OSL
123123
- **2 Performance Targets**: 10ms, 100ms TPOT

tools/simple_sdk_demo/cli/main.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ def parse(args):
4343
parser.add_argument("--system",
4444
type=str,
4545
default='h200_sxm',
46-
choices=['h100_sxm', 'h200_sxm'],
46+
choices=['h100_sxm', 'h200_sxm', 'b200_sxm', 'gb200_sxm', 'a100_sxm'],
4747
help='GPU+system Type')
4848
parser.add_argument("--model",
4949
type=str,

tools/simple_sdk_demo/sla_service/sla_service.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ def get_supported_models():
3636

3737
@app.post("/sla")
3838
def post_sla(
39-
system: str = Body("h200_sxm", description="hardware name, h200_sxm, h100_sxm"),
39+
system: str = Body("h200_sxm", description="hardware name, h200_sxm, h100_sxm, b200_sxm, gb200_sxm, a100_sxm"),
4040
backend: str = Body("trtllm", description="backend name, trtllm, sglang, vllm"),
4141
version: str = Body("0.20.0", description="trtllm version, 0.20.0"),
4242
model_name: str = Body("QWEN3_32B", description="model name"),

0 commit comments

Comments
 (0)