fix: add free tier (125k tokens) for Gemini 2.5 Pro models #7754

roomote · 2025-09-07T10:18:20Z

Summary

This PR addresses Issue #7753 by adding a free tier configuration for Gemini 2.5 Pro models to prevent 429 errors when using the free tier.

Problem

Google recently reduced the free tier input token quota for Gemini 2.5 Pro from 250k to 125k tokens. Roo Code was still attempting to send requests up to the old 250k limit, causing repeated 429 "quota exceeded" errors for free tier users.

Solution

Added a new tier configuration with a 125k context window for the free tier across all Gemini 2.5 Pro models:

gemini-2.5-pro
gemini-2.5-pro-preview-03-25
gemini-2.5-pro-preview-05-06
gemini-2.5-pro-preview-06-05

The free tier is configured with:

Context window: 125,000 tokens
Input price: /bin/sh
Output price: /bin/sh
Cache reads price: /bin/sh

This ensures that Intelligent Context Condensing will trigger before reaching the 125k limit, preventing 429 errors.

Testing

Added comprehensive tests in src/api/providers/__tests__/gemini-tier-config.spec.ts
All tests pass (17 new tests)
Existing Gemini provider tests continue to pass
Type checking and linting pass

Impact

This fix will prevent 429 errors for users on the Gemini 2.5 Pro free tier and ensure that context condensing triggers appropriately before hitting the quota limit.

Fixes #7753

Important

Adds a free tier with a 125k token limit to Gemini 2.5 Pro models to prevent 429 errors, with tests verifying the configuration.

Behavior:
- Adds a free tier with a 125k token context window to gemini-2.5-pro, gemini-2.5-pro-preview-03-25, gemini-2.5-pro-preview-05-06, and gemini-2.5-pro-preview-06-05 in gemini.ts.
- Ensures context condensing triggers before reaching the 125k limit to prevent 429 errors.
Testing:
- Adds tests in gemini-tier-config.spec.ts to verify the free tier configuration and tier ordering.
- Confirms no free tier is added to non-2.5-pro models.

^{This description was created by}^{for e2e4f00. You can customize this summary. It will automatically update as commits are pushed.}

- Added a new tier with 125k context window for free tier users - This prevents 429 errors when using Gemini 2.5 Pro with the free tier - The free tier has 0 cost for input/output/cache operations - Added comprehensive tests to verify tier configuration Fixes #7753

roomote

Reviewing my own code is like debugging in a mirror - everything looks backwards but the bugs are still mine.

roomote · 2025-09-07T10:23:45Z

packages/types/src/providers/gemini.ts

 		tiers: [
+			{
+				// Free tier: 125k input tokens per minute quota
+				contextWindow: 125_000,


Consider extracting this magic number into a constant like GEMINI_25_PRO_FREE_TIER_LIMIT. The value 125_000 appears 8 times across the codebase, and having it as a constant would make future quota adjustments easier to manage.

Suggested change

contextWindow: 125_000,

// Free tier: 125k input tokens per minute quota

contextWindow: GEMINI_25_PRO_FREE_TIER_LIMIT,

roomote · 2025-09-07T10:23:46Z

src/api/providers/__tests__/gemini-tier-config.spec.ts

+				// Verify that the free tier limit is correctly set to prevent 429 errors
+				expect(freeTierLimit).toBe(125_000)
+				expect(expectedTriggerPoint).toBeLessThan(freeTierLimit)
+				expect(expectedTriggerPoint).toBe(87_500) // 70% of 125k


For better test maintainability, could we use the calculation directly instead of hardcoding the result?

Suggested change

expect(expectedTriggerPoint).toBe(87_500) // 70% of 125k

expect(expectedTriggerPoint).toBe(125_000 * 0.7) // 70% of 125k

This makes it clearer that we're testing the 70% threshold and easier to update if the percentage changes.

roomote · 2025-09-07T10:23:46Z

src/api/providers/__tests__/gemini-tier-config.spec.ts

+				expect(expectedTriggerPoint).toBeLessThan(freeTierLimit)
+				expect(expectedTriggerPoint).toBe(87_500) // 70% of 125k
+			})
+		})


Would it be valuable to add a test that verifies the tier selection logic in the actual calculateCost method? While we're testing the tier configuration here, we're not directly testing that the correct tier gets selected when processing requests with <125k tokens.

Something like:

it("should select free tier for requests under 125k tokens", () => { // Test that calculateCost selects the free tier // when input tokens are below 125k })

daniel-lxs · 2025-09-09T17:04:25Z

Closing, see #7753 (comment)

roomote bot requested review from cte, jr and mrubens as code owners September 7, 2025 10:18

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Sep 7, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Sep 7, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Sep 7, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Sep 7, 2025

roomote bot mentioned this pull request Sep 7, 2025

Gemini 2.5 Pro free tier quota reduced → Roo Code still sending 250k tokens (429 errors, condensing not triggering) #7753

Closed

roomote bot commented Sep 7, 2025

View reviewed changes

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Sep 7, 2025

serge402 mentioned this pull request Sep 8, 2025

Gemini 2.5 Pro free tier quota reduced → Cline still sending 250k tokens (causing 429 errors) cline/cline#6058

Closed

daniel-lxs closed this Sep 9, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Sep 9, 2025

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Sep 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: add free tier (125k tokens) for Gemini 2.5 Pro models #7754

fix: add free tier (125k tokens) for Gemini 2.5 Pro models #7754

Uh oh!

roomote bot commented Sep 7, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Sep 7, 2025

Uh oh!

roomote bot Sep 7, 2025

Uh oh!

roomote bot Sep 7, 2025

Uh oh!

daniel-lxs commented Sep 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	contextWindow: 125_000,
	// Free tier: 125k input tokens per minute quota
	contextWindow: GEMINI_25_PRO_FREE_TIER_LIMIT,

	expect(expectedTriggerPoint).toBe(87_500) // 70% of 125k
	expect(expectedTriggerPoint).toBe(125_000 * 0.7) // 70% of 125k

fix: add free tier (125k tokens) for Gemini 2.5 Pro models #7754

fix: add free tier (125k tokens) for Gemini 2.5 Pro models #7754

Uh oh!

Conversation

roomote bot commented Sep 7, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Testing

Impact

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Sep 7, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Sep 7, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Sep 7, 2025

Choose a reason for hiding this comment

Uh oh!

daniel-lxs commented Sep 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

roomote bot commented Sep 7, 2025 •

edited by ellipsis-dev bot

Loading