Skip to content

Commit 6610e33

Browse files
committed
[https://nvbugs/5769712][fix] fix timeout in AutoDeploy llama accuracy test
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
1 parent 9cae727 commit 6610e33

File tree

2 files changed

+1
-3
lines changed

2 files changed

+1
-3
lines changed

tests/integration/defs/accuracy/test_llm_api_autodeploy.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ def get_default_kwargs(self, enable_chunked_prefill=False):
5959
},
6060
"compile_model": {
6161
"backend":
62-
"torch-opt",
62+
"torch-cudagraph",
6363
"cuda_graph_batch_sizes":
6464
[1, 2, 4, 8, 16, 32, 64, 128, 256, 512],
6565
},

tests/integration/test_lists/waives.txt

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -350,8 +350,6 @@ unittest/_torch/speculative/test_draft_len_schedule.py::test_correctness_across_
350350
unittest/_torch/speculative/test_eagle3.py::test_qwen3_eagle3[True-True-True-True] SKIP (https://nvbugspro.nvidia.com/bug/5749988)
351351
accuracy/test_llm_api_pytorch.py::TestSeedOss_36B::test_auto_dtype SKIP (https://nvbugs/5612438)
352352
accuracy/test_llm_api_autodeploy.py::TestNemotronH::test_auto_dtype[True] SKIP (https://nvbugs/5688721)
353-
accuracy/test_llm_api_autodeploy.py::TestLlama3_1_8B::test_auto_dtype[False-2] SKIP (https://nvbugs/5769712)
354-
accuracy/test_llm_api_autodeploy.py::TestLlama3_1_8B::test_auto_dtype[False-4] SKIP (https://nvbugs/5769712)
355353
test_e2e.py::test_openai_completions_example[trt] SKIP (https://nvbugs/5701450)
356354
accuracy/test_llm_api_pytorch.py::TestLlama3_1_8BInstruct::test_fp8_4gpus[tp4-fp8kv=False-attn_backend=TRTLLM-torch_compile=False] SKIP (https://nvbugs/5701457)
357355
triton_server/test_triton_llm.py::test_llmapi_backend[4-0-disableDecoupleMode-tensorrt_llm] SKIP (https://nvbugs/5701480)

0 commit comments

Comments
 (0)