add online runs for disagg by sixiang-google · Pull Request #123 · QiliangCui/bm-infra

sixiang-google · 2025-11-17T22:42:22Z

No description provided.

scripts/scheduler/schedule_run.sh

QiliangCui · 2025-11-17T23:04:34Z

cases/hourly_disagg.csv

@@ -1,2 +1,6 @@
-Device,Model,MaxNumSeqs,MaxNumBatchedTokens,TensorParallelSize,MaxModelLen,Dataset,InputLen,OutputLen,ExpectedETEL,NumPrompts
-v6e-8,meta-llama/Llama-3.1-8B-Instruct,128,1024,2,2048,sonnet,1800,128,,1000
+Device,Model,MaxNumSeqs,MaxNumBatchedTokens,TensorParallelSize,MaxModelLen,Dataset,InputLen,OutputLen,ExpectedETEL,NumPrompts,RequestRate


the code now can't be smart enough to find "RequestRate" in this case... the first line is just CSV header, in your case, this needs to be

Device,Model,MaxNumSeqs,MaxNumBatchedTokens,TensorParallelSize,MaxModelLen,Dataset,InputLen,OutputLen,ExpectedETEL,NumPrompts,MODELTAG,PREFIX_LEN,RequestRate

here https://github.com/QiliangCui/bm-infra/blob/main/scripts/scheduler/schedule_run.sh#L48 is how it is read

database/vllm_bm_20251117.ddl

scripts/scheduler/hourly_run.sh

patemotter

Overall LGTM, the only real issue is whether or not the ExtraEnvs logic does things in a way that we need to move your new entry before it.

patemotter · 2025-11-25T19:12:12Z

cases/hourly_disagg.csv

@@ -1,2 +1,6 @@
-Device,Model,MaxNumSeqs,MaxNumBatchedTokens,TensorParallelSize,MaxModelLen,Dataset,InputLen,OutputLen,ExpectedETEL,NumPrompts
-v6e-8,meta-llama/Llama-3.1-8B-Instruct,128,1024,2,2048,sonnet,1800,128,,1000


Nit: Was this entry meant to be deleted?

v6e-8,meta-llama/Llama-3.1-8B-Instruct,128,1024,2,2048,sonnet,1800,128,,1000

Yes, we want to test more using mlperf. This is too prefill-heavy.

scripts/scheduler/hourly_run.sh

scripts/scheduler/schedule_run.sh

QiliangCui reviewed Nov 17, 2025

View reviewed changes

sixiang-google force-pushed the main branch from 2ab9b3f to 91d34c0 Compare November 17, 2025 23:13

sixiang-google requested a review from QiliangCui November 18, 2025 01:15

QiliangCui requested a review from patemotter November 21, 2025 22:34

sixiang-google force-pushed the main branch from 91d34c0 to 95ee1df Compare November 21, 2025 22:42

QiliangCui approved these changes Nov 21, 2025

View reviewed changes

database/vllm_bm_20251117.ddl Show resolved Hide resolved

scripts/scheduler/hourly_run.sh Outdated Show resolved Hide resolved

patemotter reviewed Nov 25, 2025

View reviewed changes

patemotter approved these changes Nov 25, 2025

View reviewed changes

add online runs for disagg

d5059d5

sixiang-google force-pushed the main branch from 95ee1df to d5059d5 Compare December 3, 2025 23:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add online runs for disagg#123

add online runs for disagg#123
sixiang-google wants to merge 1 commit intoQiliangCui:mainfrom
sixiang-google:main

sixiang-google commented Nov 17, 2025

Uh oh!

Uh oh!

QiliangCui Nov 17, 2025

Uh oh!

Uh oh!

Uh oh!

patemotter left a comment

Uh oh!

patemotter Nov 25, 2025

Uh oh!

sixiang-google Dec 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -1,2 +1,6 @@
		Device,Model,MaxNumSeqs,MaxNumBatchedTokens,TensorParallelSize,MaxModelLen,Dataset,InputLen,OutputLen,ExpectedETEL,NumPrompts
		v6e-8,meta-llama/Llama-3.1-8B-Instruct,128,1024,2,2048,sonnet,1800,128,,1000

Conversation

sixiang-google commented Nov 17, 2025

Uh oh!

Uh oh!

QiliangCui Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

patemotter left a comment

Choose a reason for hiding this comment

Uh oh!

patemotter Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

sixiang-google Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants