Skip to content

Commit b71a0f7

Browse files
authored
test: Add llama 4 to ci (NVIDIA#3520)
* Add llama 4 to ci Signed-off-by: Dongfeng Yu <[email protected]> * Only test trtllm Signed-off-by: Dongfeng Yu <[email protected]> * Disable marverick Signed-off-by: Dongfeng Yu <[email protected]> --------- Signed-off-by: Dongfeng Yu <[email protected]>
1 parent fc88d67 commit b71a0f7

File tree

2 files changed

+3
-1
lines changed

2 files changed

+3
-1
lines changed

tests/integration/test_lists/test-db/l0_dgx_h200.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,3 +15,5 @@ l0_dgx_h200:
1515
tests:
1616
# ------------- PyTorch tests ---------------
1717
- accuracy/test_llm_api_pytorch.py::TestDeepSeekV3Lite::test_bfloat16_4gpus[ep4-attention_dp-cuda_graph-overlap_scheduler]
18+
- unittest/_torch/multi_gpu_modeling/test_llama4.py::test_llama4[tp8-trtllm-scout] # 1h
19+
# - unittest/_torch/multi_gpu_modeling/test_llama4.py::test_llama4[tp8-trtllm-maverick] # 3h will timeout

tests/unittest/_torch/multi_gpu_modeling/test_llama4.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ def test_llama4(model_name, backend, tp_size):
2626
]
2727

2828
pytorch_config = PyTorchConfig(attn_backend=backend, )
29-
model_dir = str(llm_models_root() / model_name)
29+
model_dir = str(llm_models_root() / "llama4-models" / model_name)
3030

3131
llm = LLM(
3232
model=model_dir,

0 commit comments

Comments
 (0)