[responsesAPI] get reasoning token metrics for simpleContext #31839

qandrew · 2026-01-06T22:18:30Z

Purpose

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Andrew Xia <[email protected]>

gemini-code-assist

Code Review

This pull request introduces the capability to compute reasoning token metrics within SimpleContext. The implementation involves a new _compute_reasoning_tokens method that gets triggered when the final output is accessed. The changes are accompanied by a good set of tests covering various scenarios. My review focuses on enhancing the robustness of the new logic, specifically around exception handling and ensuring the correctness of the calculated metrics.

gemini-code-assist · 2026-01-06T22:19:54Z

vllm/entrypoints/context.py

+            self.num_reasoning_tokens = len(self._accumulated_token_ids) - len(
+                content_token_ids
+            )


The calculation for num_reasoning_tokens could result in a negative value if len(content_token_ids) is greater than len(self._accumulated_token_ids). While extract_content_ids should ideally return a subset of the input tokens, a bug in a parser implementation could violate this. To make this more robust, I suggest adding a check to ensure num_reasoning_tokens is not negative, which would also improve debugging by logging the unexpected behavior.

Suggested change

self.num_reasoning_tokens = len(self._accumulated_token_ids) - len(

content_token_ids

)

num_reasoning_tokens = len(self._accumulated_token_ids) - len(content_token_ids)

if num_reasoning_tokens < 0:

logger.warning("Calculated negative reasoning tokens (%d). Clamping to 0.", num_reasoning_tokens)

num_reasoning_tokens = 0

self.num_reasoning_tokens = num_reasoning_tokens

gemini-code-assist · 2026-01-06T22:19:54Z

vllm/entrypoints/context.py

+            )
+
+            self._reasoning_tokens_computed = True
+        except Exception:


Using a broad except Exception: can mask underlying issues like TypeError or AttributeError, which might indicate programming errors rather than parsing failures. This makes debugging more difficult. It is recommended to catch more specific exceptions that are expected during parsing. If the parser can raise various exceptions, consider defining a custom ReasoningParsingError and wrapping the original exceptions to make the intent clear and avoid catching unrelated errors.

initial commit get reasoning token metrics

c9b14a3

Signed-off-by: Andrew Xia <[email protected]>

mergify bot added frontend gpt-oss Related to GPT-OSS models labels Jan 6, 2026

github-project-automation bot added this to gpt-oss Issues & Enhancements Jan 6, 2026

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Jan 6, 2026

gemini-code-assist bot reviewed Jan 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[responsesAPI] get reasoning token metrics for simpleContext #31839

[responsesAPI] get reasoning token metrics for simpleContext #31839

qandrew commented Jan 6, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jan 6, 2026

Uh oh!

gemini-code-assist bot Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

-            self.num_reasoning_tokens = len(self._accumulated_token_ids) - len(
-                content_token_ids
-            )
+            num_reasoning_tokens = len(self._accumulated_token_ids) - len(content_token_ids)
+            if num_reasoning_tokens < 0:
+                logger.warning("Calculated negative reasoning tokens (%d). Clamping to 0.", num_reasoning_tokens)
+                num_reasoning_tokens = 0
+            self.num_reasoning_tokens = num_reasoning_tokens

Uh oh!

[responsesAPI] get reasoning token metrics for simpleContext #31839

Are you sure you want to change the base?

[responsesAPI] get reasoning token metrics for simpleContext #31839

Conversation

qandrew commented Jan 6, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

qandrew commented Jan 6, 2026 •

edited by github-actions bot

Loading