Skip to content

Commit 09cc4e2

Browse files
[fix] fix completion stream api output_tokens not in usage (#3247)
1 parent d9e3f88 commit 09cc4e2

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

fastdeploy/entrypoints/openai/serving_completion.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -358,6 +358,7 @@ async def completion_stream_generator(
358358
usage=UsageInfo(
359359
prompt_tokens=len(prompt_batched_token_ids[idx]),
360360
completion_tokens=output_tokens[idx],
361+
total_tokens=len(prompt_batched_token_ids[idx]) + output_tokens[idx],
361362
),
362363
)
363364
yield f"data: {usage_chunk.model_dump_json(exclude_unset=True)}\n\n"

0 commit comments

Comments
 (0)