Skip to content

Commit 404fbe9

Browse files
[https://nvbugs/5277113][fix]genai-perf API change stress test (#4300)
* fix bug 5277113. Signed-off-by: Wangshanshan <[email protected]> * fix bug 5277113 and 5278517. Signed-off-by: Wangshanshan <[email protected]> --------- Signed-off-by: Wangshanshan <[email protected]>
1 parent d008d64 commit 404fbe9

File tree

4 files changed

+4
-11
lines changed

4 files changed

+4
-11
lines changed

examples/serve/genai_perf_client.sh

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,6 @@
33
genai-perf profile \
44
-m TinyLlama-1.1B-Chat-v1.0 \
55
--tokenizer TinyLlama/TinyLlama-1.1B-Chat-v1.0 \
6-
--service-kind openai \
76
--endpoint-type chat \
87
--random-seed 123 \
98
--synthetic-input-tokens-mean 128 \

requirements-dev.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,4 +30,4 @@ pytest-rerunfailures
3030
ruff==0.9.4
3131
lm_eval[api]==0.4.8
3232
docstring_parser
33-
genai-perf
33+
genai-perf==0.0.13

tests/integration/defs/stress_test/stress_test.py

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -705,8 +705,6 @@ def create_genai_perf_command(model_name,
705705
model_name,
706706
"--tokenizer",
707707
model_path,
708-
"--service-kind",
709-
"openai",
710708
"--endpoint-type",
711709
"completions",
712710
"--random-seed",
@@ -1054,8 +1052,9 @@ def extract_stress_test_metrics(artifacts_dir="./artifacts",
10541052
{}).get("avg", 0)
10551053
tokThroughput = results.get("output_token_throughput",
10561054
{}).get("avg", 0)
1057-
conCurrency = results.get("input_config",
1058-
{}).get("concurrency", 0)
1055+
conCurrency = results.get("input_config", {}).get(
1056+
"perf_analyzer", {}).get("stimulus",
1057+
{}).get("concurrency", 0)
10591058

10601059
# Try to determine model name from directory structure first
10611060
if first_dir in model_name_map:

tests/integration/test_lists/waives.txt

Lines changed: 0 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -480,11 +480,6 @@ test_e2e.py::test_ptp_quickstart_advanced_8gpus[Nemotron-Ultra-253B-nemotron-nas
480480
examples/test_gpt.py::test_starcoder_fp8_quantization_2gpu[starcoder] SKIP (https://nvbugs/5144931)
481481
examples/test_gpt.py::test_starcoder_fp8_quantization_2gpu[starcoderplus] SKIP (https://nvbugs/5144931)
482482
unittest/_torch -k "not (modeling or multi_gpu or auto_deploy)" SKIP (https://nvbugs/5271015)
483-
stress_test/stress_test.py::test_run_stress_test[llama-v3-8b-instruct-hf_tp1-stress_time_300s_timeout_450s-MAX_UTILIZATION-pytorch-stress-test] SKIP (https://nvbugs/5277113)
484-
stress_test/stress_test.py::test_run_stress_test[llama-v3-8b-instruct-hf_tp1-stress_time_300s_timeout_450s-GUARANTEED_NO_EVICT-pytorch-stress-test] SKIP (https://nvbugs/5277113)
485-
stress_test/stress_test.py::test_run_stress_test[llama-v3-8b-instruct-hf_tp1-stress_time_300s_timeout_450s-MAX_UTILIZATION-trt-stress-test] SKIP (https://nvbugs/5277113)
486-
stress_test/stress_test.py::test_run_stress_test[llama-v3-8b-instruct-hf_tp1-stress_time_300s_timeout_450s-GUARANTEED_NO_EVICT-trt-stress-test] SKIP (https://nvbugs/5277113)
487-
test_e2e.py::test_trtllm_serve_example SKIP (https://nvbugs/5278517)
488483
examples/test_whisper.py::test_llm_whisper_general[large-v3-disable_gemm_plugin-disable_attention_plugin-disable_weight_only-float16-nb:1-use_python_runtime] SKIP (https://nvbugs/5244570)
489484
unittest/_torch/speculative/test_eagle3.py SKIP (https://nvbugs/5280806)
490485
test_e2e.py::test_ptp_quickstart_multimodal[qwen2-vl-7b-instruct-Qwen2-VL-7B-Instruct-image] SKIP (https://nvbugs/5226211)

0 commit comments

Comments
 (0)