fix: limit GPT-5 models max output tokens to 10k #6959

roomote · 2025-08-11T23:07:47Z

This PR fixes the GPT-5 token limit issue by capping the max output tokens to 10k, preventing context window overflow when input approaches the 272k limit.

Problem

GPT-5 models have a max output of 128k tokens. When the input gets close to the 272k input limit, the model's output can exceed the total 400k context window, causing API errors.

Solution

As suggested in cline/cline#5474 (comment), this PR limits the max output tokens to 10k for all GPT-5 models (gpt-5, gpt-5-mini, gpt-5-nano).

Changes

Added special handling in getModelMaxOutputTokens() to detect GPT-5 models and cap their output at 10k tokens
Users can still override with a lower value via settings, but it will be capped at 10k maximum
Added comprehensive test coverage for the new behavior

Testing

All existing tests pass
Added 6 new tests specifically for GPT-5 token limiting behavior
Verified that non-GPT-5 models are not affected

Fixes #6856

Important

Caps GPT-5 models' max output tokens to 10k in getModelMaxOutputTokens() to prevent context window overflow, with comprehensive testing added.

Behavior:
- Caps GPT-5 models' max output tokens to 10k in getModelMaxOutputTokens() to prevent context window overflow.
- Allows user override for lower values but caps at 10k.
- Non-GPT-5 models remain unaffected.
Testing:
- Added 6 new tests in api.spec.ts for GPT-5 token limiting behavior.
- Verified existing tests pass and non-GPT-5 models are unaffected.

^{This description was created by}^{for 054ad59. You can customize this summary. It will automatically update as commits are pushed.}

…verflow - Added special handling for GPT-5 models in getModelMaxOutputTokens() - Limits max output to 10k tokens as recommended in cline/cline#5474 (comment) - Prevents context window overflow when input approaches 272k token limit - Added comprehensive tests for GPT-5 token limiting behavior Fixes #6856

roomote

Reviewing my own code is like debugging in a mirror - everything looks backward but the bugs are still mine.

roomote · 2025-08-11T23:12:05Z

src/shared/api.ts

+		// Allow user override via settings, but cap at 10k
+		const userMaxTokens = settings?.modelMaxTokens
+		if (userMaxTokens) {
+			return Math.min(userMaxTokens, 10000)


Consider extracting this magic number to a named constant like GPT5_MAX_OUTPUT_TOKENS = 10000 for better maintainability. Even I'm having to count zeros here.

roomote · 2025-08-11T23:12:05Z

src/shared/api.ts

 	}

+	// Special handling for GPT-5 models to prevent context window overflow
+	// Limit max output to 10k tokens as per https://github.com/cline/cline/issues/5474#issuecomment-3172109387


The comment could be more descriptive about WHY we're limiting to 10k. Consider: // Limit max output to 10k tokens to prevent total context window overflow (400k) when input approaches the 272k limit

roomote · 2025-08-11T23:12:05Z

src/shared/__tests__/api.spec.ts

+				format: "openai",
+			})
+
+			expect(result).toBe(10_000)


Inconsistent number formatting - the implementation uses 10000 but tests mix 10_000 and 10000. Consider standardizing to use underscore separators throughout for better readability.

roomote · 2025-08-11T23:12:06Z

src/shared/__tests__/api.spec.ts

+			expect(result).toBe(10_000)
+		})
+
+		test("should respect user override for GPT-5 models but cap at 10k", () => {


Consider adding a test case for when settings.modelMaxTokens is undefined or 0 to ensure the default 10k is applied correctly in those edge cases.

daniel-lxs · 2025-08-12T01:04:49Z

@roomote-agent use switch_tool to switch to pr-fixer and fix this PR

roomote · 2025-08-12T01:06:34Z

I see your request to switch to pr-fixer mode. Let me analyze and fix this PR for you.

- Extract GPT5_MAX_OUTPUT_TOKENS as a named constant (10,000) - Improve model detection with more specific regex pattern - Add comprehensive documentation explaining the context window overflow issue - Add tests for date-suffixed models and invalid model name patterns - Update all test assertions to use the named constant

roomote · 2025-08-12T01:19:09Z

✅ PR has been successfully improved!

Summary of Changes

I have enhanced the GPT-5 token limit implementation with the following improvements:

🔧 Code Quality Improvements

Extracted magic number as a named constant: Created GPT5_MAX_OUTPUT_TOKENS = 10_000 for better maintainability
Improved model detection: Replaced simple startsWith() check with a more specific regex pattern /^gpt-5(-mini|-nano)?(-\d{4}-\d{2}-\d{2})?$/i that properly matches GPT-5 model variants
Enhanced documentation: Added comprehensive JSDoc comments explaining the context window overflow issue and linking to the relevant GitHub issue

✅ Test Coverage

Updated all test assertions to use the named constant
Added new test cases for:
- Date-suffixed GPT-5 models (e.g., gpt-5-2025-08-07)
- Invalid model names that should NOT be treated as GPT-5 models

📊 CI Status

All checks are passing:

✅ Compile
✅ Unit tests (Ubuntu & Windows)
✅ Integration tests
✅ Code analysis
✅ Translations
✅ Knip
✅ CodeQL

The PR is now ready for review with improved code quality and maintainability while preserving the original functionality.

- Removed complex regex pattern for detecting GPT-5 models - Now using simple startsWith('gpt-5') check as requested - Updated tests to reflect the new behavior where any model starting with 'gpt-5' is treated as a GPT-5 model

roomote bot requested review from cte, jr and mrubens as code owners August 11, 2025 23:07

github-project-automation bot added this to Roo Code Roadmap Aug 11, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Aug 11, 2025

github-project-automation bot added this to Roo Code Roadmap Aug 11, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Aug 11, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Aug 11, 2025

roomote bot commented Aug 11, 2025

View reviewed changes

roomote bot mentioned this pull request Aug 11, 2025

[BUG] OpenAI GPT 5 Token Limits #6856

Closed

daniel-lxs moved this from Triage to PR [Needs Review] in Roo Code Roadmap Aug 12, 2025

RooCodeInc deleted a comment from roomote bot Aug 12, 2025

refactor: replace GPT-5 regex with simpler startsWith check

054ad59

- Removed complex regex pattern for detecting GPT-5 models - Now using simple startsWith('gpt-5') check as requested - Updated tests to reflect the new behavior where any model starting with 'gpt-5' is treated as a GPT-5 model

daniel-lxs closed this Aug 12, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Aug 12, 2025

github-project-automation bot moved this from PR [Needs Review] to Done in Roo Code Roadmap Aug 12, 2025

daniel-lxs reopened this Aug 12, 2025

github-project-automation bot moved this from Done to New in Roo Code Roadmap Aug 12, 2025

github-project-automation bot moved this from Done to Triage in Roo Code Roadmap Aug 12, 2025

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Aug 12, 2025

daniel-lxs closed this Aug 12, 2025

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Aug 12, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Aug 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: limit GPT-5 models max output tokens to 10k #6959

fix: limit GPT-5 models max output tokens to 10k #6959

Uh oh!

roomote bot commented Aug 11, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Aug 11, 2025

Uh oh!

roomote bot Aug 11, 2025

Uh oh!

roomote bot Aug 11, 2025

Uh oh!

roomote bot Aug 11, 2025

Uh oh!

daniel-lxs commented Aug 12, 2025

Uh oh!

roomote bot commented Aug 12, 2025

Uh oh!

roomote bot commented Aug 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: limit GPT-5 models max output tokens to 10k #6959

fix: limit GPT-5 models max output tokens to 10k #6959

Uh oh!

Conversation

roomote bot commented Aug 11, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Changes

Testing

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 11, 2025

Choose a reason for hiding this comment

Uh oh!

daniel-lxs commented Aug 12, 2025

Uh oh!

roomote bot commented Aug 12, 2025

Uh oh!

roomote bot commented Aug 12, 2025

Summary of Changes

🔧 Code Quality Improvements

✅ Test Coverage

📊 CI Status

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

roomote bot commented Aug 11, 2025 •

edited by ellipsis-dev bot

Loading