fix: bypass 20% context limit for OpenAI Compatible providers #8834

roomote · 2025-10-26T03:44:16Z

Related GitHub Issue

Closes: #8833

Roo Code Task Context (Optional)

This PR was created with assistance from Roo Code to address the context limit issue.

Description

This PR fixes an issue where OpenAI Compatible API providers were incorrectly limited to 20% of their context window, causing operations to fail when the context exceeded 128k tokens.

Key changes:

Added an exception in getModelMaxOutputTokens() to bypass the 20% context window cap for OpenAI Compatible providers
OpenAI Compatible providers are identified by having apiProvider: 'openai' with a custom openAiBaseUrl (not the default OpenAI API URL)
This allows these providers to use their full configured maxTokens value, similar to the existing GPT-5 models exception

Implementation details:

The fix follows the same pattern as the existing GPT-5 exception
OpenAI Compatible detection checks for a custom baseUrl that differs from the default OpenAI API endpoint
The change maintains backward compatibility for regular OpenAI and other providers

Test Procedure

How I tested:

Added comprehensive unit tests in src/shared/__tests__/api.spec.ts covering:
- OpenAI Compatible providers with various custom URLs bypass the 20% limit
- Regular OpenAI provider still applies the 20% cap
- Edge cases like missing or default baseUrl values
Ran all tests with cd src && npx vitest run shared/__tests__/api.spec.ts - all tests pass
Verified linting passes with the pre-commit hooks

How to verify:

Run the updated tests: cd src && npx vitest run shared/__tests__/api.spec.ts
Configure an OpenAI Compatible provider (e.g., using a local LLM server) with a model that has >128k context
Verify that operations no longer fail when context exceeds 128k tokens

Pre-Submission Checklist

Issue Linked: This PR is linked to an approved GitHub Issue (see "Related GitHub Issue" above).
Scope: My changes are focused on the linked issue (one major feature/fix per PR).
Self-Review: I have performed a thorough self-review of my code.
Testing: New and/or updated tests have been added to cover my changes (if applicable).
Documentation Impact: I have considered if my changes require documentation updates (see "Documentation Updates" section below).
Contribution Guidelines: I have read and agree to the Contributor Guidelines.

Screenshots / Videos

N/A - This is a backend fix with no UI changes.

Documentation Updates

No documentation updates are required.
Yes, documentation updates are required. (Consider documenting that OpenAI Compatible providers are not subject to the 20% context window limitation in the provider configuration docs).

Additional Notes

This fix ensures that OpenAI Compatible providers can utilize their full context window capacity, which is essential for users working with large context models through custom API endpoints (like local LLM servers, Groq, Together AI, etc.).

The implementation has been reviewed with 92% confidence and passes all quality checks including:

Code convention adherence
Security assessment (no vulnerabilities)
Comprehensive test coverage

Get in Touch

Available for any questions about this PR.

Important

Bypass 20% context limit for OpenAI Compatible providers in getModelMaxOutputTokens() to allow full use of maxTokens.

Behavior:
- getModelMaxOutputTokens() in api.ts now bypasses the 20% context window cap for OpenAI Compatible providers.
- OpenAI Compatible providers are identified by apiProvider: 'openai' with a custom openAiBaseUrl.
- This change allows full use of maxTokens for these providers, similar to GPT-5 models.
Tests:
- Added tests in api.spec.ts to verify OpenAI Compatible providers bypass the 20% limit.
- Tests cover various custom URLs, regular OpenAI provider, and edge cases like missing openAiBaseUrl.
Misc:
- Maintains backward compatibility for regular OpenAI and other providers.

^{This description was created by}^{for b15c90d. You can customize this summary. It will automatically update as commits are pushed.}

- Added exception for OpenAI Compatible providers to use full maxTokens - OpenAI Compatible providers (identified by custom baseUrl) now bypass the 20% context limit - This fixes the issue where operations fail when context exceeds 128k - Added comprehensive tests for the new behavior Fixes #8833

roomote · 2025-10-26T03:44:38Z

Review Summary

I've reviewed this pull request and identified 1 issue that should be addressed:

URL comparison for OpenAI base URL doesn't handle URL normalization (trailing slashes, case variations)

Follow Along on Roo Code Cloud

roomote · 2025-10-26T03:47:32Z

src/shared/api.ts

+		const isOpenAiCompatible =
+			settings?.apiProvider === "openai" &&
+			settings?.openAiBaseUrl &&
+			settings.openAiBaseUrl !== "https://api.openai.com/v1"


The exact string comparison for the OpenAI base URL doesn't handle URL normalization. If a user configures the official OpenAI API with a trailing slash (https://api.openai.com/v1/) or different casing (https://API.openai.com/v1), the code will incorrectly treat it as an OpenAI Compatible provider and bypass the 20% context limit. This could cause official OpenAI models to attempt using more tokens than allowed. Consider normalizing the URL (trim trailing slashes, lowercase the hostname) before comparison or using URL parsing to compare the normalized forms.

roomote

Review completed. Found 1 issue that should be addressed before merging.

roomote bot requested review from cte, jr and mrubens as code owners October 26, 2025 03:44

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Oct 26, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Oct 26, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Oct 26, 2025

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. bug Something isn't working documentation Improvements or additions to documentation labels Oct 26, 2025

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Oct 26, 2025

roomote bot commented Oct 26, 2025

View reviewed changes

roomote bot mentioned this pull request Oct 26, 2025

[BUG] The context length may be stuck at 128k #8833

Closed

mrubens closed this Oct 26, 2025

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Oct 26, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Oct 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: bypass 20% context limit for OpenAI Compatible providers #8834

fix: bypass 20% context limit for OpenAI Compatible providers #8834

Uh oh!

roomote bot commented Oct 26, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

roomote bot commented Oct 26, 2025 •

edited

Loading

Uh oh!

roomote bot Oct 26, 2025

Uh oh!

roomote bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: bypass 20% context limit for OpenAI Compatible providers #8834

fix: bypass 20% context limit for OpenAI Compatible providers #8834

Uh oh!

Conversation

roomote bot commented Oct 26, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related GitHub Issue

Roo Code Task Context (Optional)

Description

Test Procedure

Pre-Submission Checklist

Screenshots / Videos

Documentation Updates

Additional Notes

Get in Touch

Uh oh!

roomote bot commented Oct 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review Summary

Uh oh!

roomote bot Oct 26, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

roomote bot commented Oct 26, 2025 •

edited by ellipsis-dev bot

Loading

roomote bot commented Oct 26, 2025 •

edited

Loading