fix(backend): Handle missing content in streaming delta #316

git-jxj · 2025-09-11T08:32:23Z

Summary

This PR fixes a KeyError: 'content' that occurs when processing streaming chat completions.

Details

When using the chat_completions endpoint with stream=True, the final delta chunk sent by the server may not contain a content key. This is part of the standard API behavior to signal the end of the stream.

The existing code in _extract_completions_delta_content did not account for this possibility and tried to access delta['content'] directly, leading to a KeyError and causing the benchmark process to crash when the stream ended.

Test Plan

This was discovered while running guidellm benchmark against an OpenAI-compatible API endpoint (via litellm) that correctly implements the streaming protocol.

guidellm benchmark
--target "http://10.64.1.62:4000/v1"
--model "qwen3-06b-2"
--processor "Qwen/Qwen3-0.6B"
--rate-type "synchronous"
--max-requests 1
--data "prompt_tokens=32,output_tokens=32,samples=1"

Related Issues

#315

Resolves #

"I certify that all code in this PR is my own, except as noted below."

Use of AI

Includes AI-assisted code completion
Includes code generated by an AI application
Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes ## WRITTEN BY AI ##)

sjmonson

Looks good to me but needs signoff.

Signed-off-by: xinjun.jiang <[email protected]>

…#316) ## Summary This PR fixes a `KeyError: 'content'` that occurs when processing streaming chat completions. ## Details When using the `chat_completions` endpoint with `stream=True`, the final `delta` chunk sent by the server may not contain a `content` key. This is part of the standard API behavior to signal the end of the stream. The existing code in `_extract_completions_delta_content` did not account for this possibility and tried to access `delta['content']` directly, leading to a `KeyError` and causing the benchmark process to crash when the stream ended. ## Test Plan This was discovered while running `guidellm benchmark` against an OpenAI-compatible API endpoint (via `litellm`) that correctly implements the streaming protocol. guidellm benchmark \ --target "http://10.64.1.62:4000/v1" \ --model "qwen3-06b-2" \ --processor "Qwen/Qwen3-0.6B" \ --rate-type "synchronous" \ --max-requests 1 \ --data "prompt_tokens=32,output_tokens=32,samples=1" ## Related Issues vllm-project#315 - Resolves # --- - [x] "I certify that all code in this PR is my own, except as noted below." ## Use of AI - [ ] Includes AI-assisted code completion - [ ] Includes code generated by an AI application - [ ] Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes `## WRITTEN BY AI ##`) Signed-off-by: xinjun.jiang <[email protected]> Co-authored-by: Samuel Monson <[email protected]>

git-jxj mentioned this pull request Sep 11, 2025

Unable to test openai/gpt-oss-120b via vllm #281

Closed

sjmonson previously approved these changes Sep 11, 2025

View reviewed changes

git-jxj force-pushed the fix/streaming-keyerror-content branch from 55d32b0 to 697ea5e Compare September 12, 2025 02:41

fix(backend): Handle missing content in streaming delta

6b0945c

Signed-off-by: xinjun.jiang <[email protected]>

git-jxj dismissed sjmonson’s stale review via 6b0945c September 12, 2025 02:51

git-jxj force-pushed the fix/streaming-keyerror-content branch from 697ea5e to 6b0945c Compare September 12, 2025 02:51

Merge branch 'main' into fix/streaming-keyerror-content

c0a3946

sjmonson approved these changes Sep 12, 2025

View reviewed changes

sjmonson merged commit 0ce21da into vllm-project:main Sep 12, 2025
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(backend): Handle missing content in streaming delta #316

fix(backend): Handle missing content in streaming delta #316

Uh oh!

git-jxj commented Sep 11, 2025 •

edited

Loading

Uh oh!

sjmonson left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(backend): Handle missing content in streaming delta #316

fix(backend): Handle missing content in streaming delta #316

Uh oh!

Conversation

git-jxj commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details

Test Plan

Related Issues

Use of AI

Uh oh!

sjmonson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

git-jxj commented Sep 11, 2025 •

edited

Loading