Skip to content

Commit 66c6973

Browse files
XuZhang99liutongxuan
authored andcommitted
bugfix: correct the calculation of num_prompt_tokens_per_request and num_generated_tokens_per_request metrics.
1 parent c116cc3 commit 66c6973

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

xllm/core/framework/batch/batch_factory.cpp

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -69,8 +69,10 @@ std::vector<Batch> BatchFactory::create_batches(
6969
COUNTER_ADD(num_processing_tokens_total_prompt, num_prompt_tokens);
7070
COUNTER_ADD(num_processing_tokens_total_generated, num_generated_tokens);
7171

72-
HISTOGRAM_OBSERVE(num_prompt_tokens_per_request, num_prompt_tokens);
73-
HISTOGRAM_OBSERVE(num_generated_tokens_per_request, num_generated_tokens);
72+
HISTOGRAM_OBSERVE(num_prompt_tokens_per_request,
73+
num_prompt_tokens / running_sequences.size());
74+
HISTOGRAM_OBSERVE(num_generated_tokens_per_request,
75+
num_generated_tokens / running_sequences.size());
7476

7577
return batches;
7678
}

0 commit comments

Comments
 (0)