From 8ac0c29bca5b9a05411278c5edc3a7562b7b6d8a Mon Sep 17 00:00:00 2001 From: Eran Geva <19514940+MrGeva@users.noreply.github.com> Date: Thu, 1 Jan 2026 02:59:56 -0800 Subject: [PATCH 1/3] added llama 70b Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com> --- tests/integration/test_lists/test-db/l0_perf.yml | 1 + 1 file changed, 1 insertion(+) diff --git a/tests/integration/test_lists/test-db/l0_perf.yml b/tests/integration/test_lists/test-db/l0_perf.yml index b9b4b42db66..68cdf29dcf6 100644 --- a/tests/integration/test_lists/test-db/l0_perf.yml +++ b/tests/integration/test_lists/test-db/l0_perf.yml @@ -43,3 +43,4 @@ l0_perf: backend: pytorch tests: - perf/test_perf.py::test_perf[llama_v3.1_8b_instruct-bench-_autodeploy-float16-input_output_len:128,128-reqs:8192] + - perf/test_perf.py::test_perf[llama_v3.3_70b_instruct_fp8-bench-_autodeploy-float8-input_output_len:1024,1024-reqs:8192] From 787c4f7570349e0fbf6c36ab16bb2442a2bdb52a Mon Sep 17 00:00:00 2001 From: Eran Geva <19514940+MrGeva@users.noreply.github.com> Date: Sun, 4 Jan 2026 00:58:21 -0800 Subject: [PATCH 2/3] deepseek r1 Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com> --- tests/integration/test_lists/test-db/l0_perf.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/tests/integration/test_lists/test-db/l0_perf.yml b/tests/integration/test_lists/test-db/l0_perf.yml index 68cdf29dcf6..3841adceac3 100644 --- a/tests/integration/test_lists/test-db/l0_perf.yml +++ b/tests/integration/test_lists/test-db/l0_perf.yml @@ -43,4 +43,4 @@ l0_perf: backend: pytorch tests: - perf/test_perf.py::test_perf[llama_v3.1_8b_instruct-bench-_autodeploy-float16-input_output_len:128,128-reqs:8192] - - perf/test_perf.py::test_perf[llama_v3.3_70b_instruct_fp8-bench-_autodeploy-float8-input_output_len:1024,1024-reqs:8192] + - perf/test_perf.py::test_perf[deepseek_r1_distill_qwen_32b-bench-_autodeploy-float16-input_output_len:1024,1024-reqs:8192] From 87afa20ea94dce866864a07d895b48fd984c38ad Mon Sep 17 00:00:00 2001 From: Eran Geva <19514940+MrGeva@users.noreply.github.com> Date: Sun, 4 Jan 2026 06:47:39 -0800 Subject: [PATCH 3/3] reduces reqs Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com> --- tests/integration/test_lists/test-db/l0_perf.yml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/tests/integration/test_lists/test-db/l0_perf.yml b/tests/integration/test_lists/test-db/l0_perf.yml index 3841adceac3..4b5d50fb4fa 100644 --- a/tests/integration/test_lists/test-db/l0_perf.yml +++ b/tests/integration/test_lists/test-db/l0_perf.yml @@ -43,4 +43,4 @@ l0_perf: backend: pytorch tests: - perf/test_perf.py::test_perf[llama_v3.1_8b_instruct-bench-_autodeploy-float16-input_output_len:128,128-reqs:8192] - - perf/test_perf.py::test_perf[deepseek_r1_distill_qwen_32b-bench-_autodeploy-float16-input_output_len:1024,1024-reqs:8192] + - perf/test_perf.py::test_perf[deepseek_r1_distill_qwen_32b-bench-_autodeploy-float16-input_output_len:1024,1024-reqs:512]