feat: exclude GPT-5 models from 20% context window output token cap #6963

roomote · 2025-08-12T01:54:46Z

This PR addresses a Slack request to exclude models containing "gpt-5" from the logic that caps the max output tokens to 20% of the context window.

Changes

Modified getModelMaxOutputTokens() in src/shared/api.ts to detect GPT-5 models (case-insensitive check for "gpt-5" in modelId)
GPT-5 models now bypass the 20% cap and use their exact configured max output tokens (e.g., 128k)
Non-GPT-5 models continue to be capped at 20% of context window as before

Testing

Added comprehensive test coverage for GPT-5 exclusion logic
Tests verify that various GPT-5 model IDs bypass the cap
Tests confirm non-GPT-5 models still have the cap applied
All existing tests continue to pass

Impact

This change allows GPT-5 models to utilize their full 128k max output token capacity as configured, improving their ability to generate longer responses without artificial limitations.

Important

getModelMaxOutputTokens now allows GPT-5 models to bypass the 20% context window cap, using their full configured max tokens, with tests verifying this behavior.

Behavior:
- getModelMaxOutputTokens in api.ts now excludes GPT-5 models from the 20% context window cap, allowing them to use their full configured max tokens.
- Non-GPT-5 models continue to be capped at 20% of the context window.
Testing:
- Added tests in api.spec.ts to verify GPT-5 models bypass the cap and use full max tokens.
- Confirmed non-GPT-5 models still adhere to the 20% cap.
Misc:
- Comprehensive test coverage ensures existing functionality remains intact.

^{This description was created by}^{for 133a953. You can customize this summary. It will automatically update as commits are pushed.}

- GPT-5 models now bypass the 20% context window cap and use their full configured max output tokens (e.g., 128k) - Added logic to detect GPT-5 models by checking if modelId contains "gpt-5" (case-insensitive) - Added comprehensive test coverage for GPT-5 exclusion logic - Non-GPT-5 models continue to be capped at 20% of context window as before

roomote

Reviewing my own code because apparently I trust no one, not even myself.

roomote · 2025-08-12T01:58:23Z

src/shared/api.ts

+	// Exception: GPT-5 models should use their exact configured max output tokens
 	if (model.maxTokens) {
+		// Check if this is a GPT-5 model (case-insensitive)
+		const isGpt5Model = modelId.toLowerCase().includes("gpt-5")


The pattern matching here could be more precise. Currently, modelId.toLowerCase().includes("gpt-5") would match unintended model IDs like "not-gpt-5-compatible" or "legacy-gpt-500".

Could we consider a more specific pattern? Perhaps:

Suggested change

const isGpt5Model = modelId.toLowerCase().includes("gpt-5")

// Check if this is a GPT-5 model (case-insensitive, more precise matching)

const isGpt5Model = /^(openai\/)?gpt-5($|-|\.)/.test(modelId.toLowerCase())

This would match "gpt-5", "gpt-5-turbo", "openai/gpt-5-preview" but not "not-gpt-5" or "gpt-500".

roomote · 2025-08-12T01:58:23Z

src/shared/api.ts

 	}

 	// If model has explicit maxTokens, clamp it to 20% of the context window
+	// Exception: GPT-5 models should use their exact configured max output tokens


This GPT-5 exception is a significant behavior change that should be documented in the function's JSDoc. Future maintainers might wonder why GPT-5 models get special treatment. Could we add a note to the function documentation explaining this exception?

roomote · 2025-08-12T01:58:23Z

src/shared/__tests__/api.spec.ts

+		}
+
+		// Test various GPT-5 model IDs
+		const gpt5ModelIds = ["gpt-5", "gpt-5-turbo", "GPT-5", "openai/gpt-5-preview", "gpt-5-32k", "GPT-5-TURBO"]


Great test coverage! Though it might be worth adding edge case tests to ensure the pattern matching doesn't have false positives. For example, testing that "not-gpt-5", "gpt-500", or "legacy-gpt-5-incompatible" don't incorrectly bypass the cap.

daniel-lxs

LGTM

roomote bot requested review from cte, jr and mrubens as code owners August 12, 2025 01:54

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Aug 12, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Aug 12, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Aug 12, 2025

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. enhancement New feature or request labels Aug 12, 2025

roomote bot commented Aug 12, 2025

View reviewed changes

daniel-lxs approved these changes Aug 12, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Aug 12, 2025

mrubens merged commit 5e07bc4 into main Aug 12, 2025
22 checks passed

mrubens deleted the feature/exclude-gpt5-from-output-token-cap branch August 12, 2025 02:05

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Aug 12, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Aug 12, 2025

daniel-lxs mentioned this pull request Aug 12, 2025

[BUG] OpenAI GPT 5 Token Limits #6856

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: exclude GPT-5 models from 20% context window output token cap #6963

feat: exclude GPT-5 models from 20% context window output token cap #6963

roomote bot commented Aug 12, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Aug 12, 2025

Uh oh!

roomote bot Aug 12, 2025

Uh oh!

roomote bot Aug 12, 2025

Uh oh!

daniel-lxs left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	const isGpt5Model = modelId.toLowerCase().includes("gpt-5")
	// Check if this is a GPT-5 model (case-insensitive, more precise matching)
	const isGpt5Model = /^(openai\/)?gpt-5($\|-\|\.)/.test(modelId.toLowerCase())

feat: exclude GPT-5 models from 20% context window output token cap #6963

feat: exclude GPT-5 models from 20% context window output token cap #6963

Conversation

roomote bot commented Aug 12, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Testing

Impact

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

daniel-lxs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

roomote bot commented Aug 12, 2025 •

edited by ellipsis-dev bot

Loading