add random concurrent workload template for inference-perf by huaxig · Pull Request #635 · llm-d/llm-d-benchmark

huaxig · 2026-01-30T23:53:29Z

introduces a new inference-perf workload profile for sanity testing with concurrent users.

The sanity_random_concurrent profile is designed to test the stability and performance of the llm-d stack under increasing load. It consists of 4 stages, starting with 1 concurrent user and scaling up to 8.

namasl

Profiles prefixed with sanity_ tend to be small benchmarks that run quickly (sanity checks, used for tests and CI). Drop the prefix, then it looks good to me.

maugustosilva · 2026-02-02T20:55:37Z

+1 on @namasl comment

Signed-off-by: Xia Hua <huaxi@google.com>

huaxig · 2026-02-02T23:38:56Z

SG, rename it to random_concurrent.yaml.in

add random concurrent workload template for inference-perf

f262743

namasl requested changes Feb 1, 2026

View reviewed changes

Rename sanity_random_concurrent.yaml.in to random_concurrent.yaml.in

6949880

Signed-off-by: Xia Hua <huaxi@google.com>

namasl approved these changes Feb 2, 2026

View reviewed changes

namasl merged commit 2efad69 into llm-d:main Feb 2, 2026
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add random concurrent workload template for inference-perf#635

add random concurrent workload template for inference-perf#635
namasl merged 2 commits intollm-d:mainfrom
huaxig:template

huaxig commented Jan 30, 2026

Uh oh!

namasl left a comment

Uh oh!

maugustosilva commented Feb 2, 2026

Uh oh!

huaxig commented Feb 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

huaxig commented Jan 30, 2026

Uh oh!

namasl left a comment

Choose a reason for hiding this comment

Uh oh!

maugustosilva commented Feb 2, 2026

Uh oh!

huaxig commented Feb 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants