Skip to content

Commit 18c3f3b

Browse files
committed
[https://nvbugs/5769712][fix] fix timeout in AutoDeploy llama accuracy test
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
1 parent 069ad68 commit 18c3f3b

File tree

2 files changed

+1
-2
lines changed

2 files changed

+1
-2
lines changed

tests/integration/defs/accuracy/test_llm_api_autodeploy.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@ def get_default_kwargs(self, enable_chunked_prefill=False):
4444
},
4545
"compile_model": {
4646
"backend":
47-
"torch-opt",
47+
"torch-cudagraph",
4848
"cuda_graph_batch_sizes":
4949
[1, 2, 4, 8, 16, 32, 64, 128, 256, 512],
5050
},

tests/integration/test_lists/waives.txt

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -227,7 +227,6 @@ full:H100_PCIe/unittest/llmapi/test_llm_pytorch.py::test_llama_7b_multi_lora_evi
227227
unittest/_torch/speculative/test_draft_len_schedule.py::test_correctness_across_batch_sizes[model_drafter-schedule1] SKIP (https://nvbugs/5680911)
228228
accuracy/test_llm_api_pytorch.py::TestSeedOss_36B::test_auto_dtype SKIP (https://nvbugs/5612438)
229229
accuracy/test_llm_api_autodeploy.py::TestNemotronH::test_auto_dtype[True] SKIP (https://nvbugs/5688721)
230-
accuracy/test_llm_api_autodeploy.py::TestLlama3_1_8B::test_auto_dtype[False-4] SKIP (https://nvbugs/5769712)
231230
test_e2e.py::test_openai_completions_example[trt] SKIP (https://nvbugs/5701450)
232231
accuracy/test_llm_api_pytorch.py::TestLlama3_1_8BInstruct::test_fp8_4gpus[tp4-fp8kv=False-attn_backend=TRTLLM-torch_compile=False] SKIP (https://nvbugs/5701457)
233232
triton_server/test_triton_llm.py::test_llmapi_backend[4-0-disableDecoupleMode-tensorrt_llm] SKIP (https://nvbugs/5701480)

0 commit comments

Comments
 (0)