Clamp GPT-5 max output tokens to 20% of context window #8495

daniel-lxs · 2025-10-03T16:27:58Z

This PR standardizes GPT-5 output token limits with all other models:\n\n- Remove GPT-5 exception in getModelMaxOutputTokens() so max output = min(model.maxTokens, ceil(0.2 * contextWindow)).\n- Update tests accordingly: src/shared/tests/api.spec.ts, src/shared/tests/api.spec.ts.\n- Verified locally: all tests passing (298 files, 3906 tests).\n\nExample: 400k context → 80k max output tokens.\n\nRationale: Aligns GPT-5 behavior (including OpenRouter) with other models and avoids oversized completions when providers report very high max_completion_tokens.

Important

Standardizes GPT-5 output token limits by applying a 20% cap, aligning with other models, and updates tests accordingly.

Behavior:
- Remove GPT-5 exception in getModelMaxOutputTokens() in api.ts to apply 20% cap on output tokens, aligning with other models.
- Example: For a 400k context window, max output tokens are capped at 80k.
Tests:
- Update tests in api.spec.ts to reflect the new 20% cap behavior for GPT-5 models.
- Tests verify that GPT-5 models are capped to 20% of context window or model.maxTokens, whichever is smaller.
Rationale:
- Aligns GPT-5 behavior with other models and prevents oversized completions when providers report high max tokens.

^{This description was created by}^{for 8acee59. You can customize this summary. It will automatically update as commits are pushed.}

… remove GPT-5 exception and update tests

roomote

I found some issues that need attention. See inline comments for details.

roomote · 2025-10-03T17:12:27Z

src/shared/__tests__/api.spec.ts

 		}

-		// Test various GPT-5 model IDs
 		const gpt5ModelIds = ["gpt-5", "gpt-5-turbo", "GPT-5", "openai/gpt-5-preview", "gpt-5-32k", "GPT-5-TURBO"]


[P3] Missing OpenRouter coverage for GPT-5 clamping. The PR mentions alignment "including OpenRouter", but this test only exercises format: 'openai'. Add a parallel assertion using format: 'openrouter' and settings.apiProvider: 'openrouter' with a GPT-5 modelId (e.g., 'openai/gpt-5-preview') to ensure the 20% cap applies via the OpenRouter path as well.

feat(shared): clamp GPT-5 max output tokens to 20% of context window;…

8acee59

… remove GPT-5 exception and update tests

daniel-lxs requested review from cte, jr and mrubens as code owners October 3, 2025 16:27

github-project-automation bot added this to Roo Code Roadmap Oct 3, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Oct 3, 2025

github-project-automation bot added this to Roo Code Roadmap Oct 3, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Oct 3, 2025

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. enhancement New feature or request labels Oct 3, 2025

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Oct 3, 2025

daniel-lxs moved this from Triage to PR [Needs Review] in Roo Code Roadmap Oct 3, 2025

mrubens approved these changes Oct 3, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Oct 3, 2025

cte approved these changes Oct 3, 2025

View reviewed changes

mrubens merged commit 97f9686 into main Oct 3, 2025
23 checks passed

mrubens deleted the feat/limit-gpt5-max-output-20pct branch October 3, 2025 16:41

github-project-automation bot moved this from PR [Needs Review] to Done in Roo Code Roadmap Oct 3, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Oct 3, 2025

roomote bot reviewed Oct 3, 2025

View reviewed changes

mrubens mentioned this pull request Oct 9, 2025

Revert "Clamp GPT-5 max output tokens to 20% of context window" #8582

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clamp GPT-5 max output tokens to 20% of context window #8495

Clamp GPT-5 max output tokens to 20% of context window #8495

Uh oh!

daniel-lxs commented Oct 3, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Clamp GPT-5 max output tokens to 20% of context window #8495

Clamp GPT-5 max output tokens to 20% of context window #8495

Uh oh!

Conversation

daniel-lxs commented Oct 3, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

daniel-lxs commented Oct 3, 2025 •

edited by ellipsis-dev bot

Loading