fix: make token counting API requests asynchronous for Gemini/Anthropic #6063

roomote · 2025-07-22T13:50:56Z

This PR resolves #3666 by making the token counting API requests asynchronous, so they no longer block the main inference request.

Changes

Modified countTokens() in GeminiHandler to return tiktoken estimate immediately while API call happens in background
Modified countTokens() in AnthropicHandler to return tiktoken estimate immediately while API call happens in background
Added countTokensAsync() private method to both providers to handle the actual API call
Added tests to verify asynchronous behavior and immediate returns
Vertex provider automatically inherits the fix from GeminiHandler

Benefits

Main inference requests start immediately without waiting for token counting
Preserves accurate token counts from the API (they just arrive asynchronously)
Fallback to tiktoken ensures we always have a working estimate
No breaking changes to the API

Testing

All existing tests pass
Added new tests specifically for the asynchronous behavior
Verified that token counting returns immediately (< 100ms) even when API call takes longer

Fixes #3666

Important

Make token counting asynchronous in GeminiHandler and AnthropicHandler, improving performance by not blocking main requests.

Behavior:
- countTokens() in GeminiHandler and AnthropicHandler now returns tiktoken estimate immediately, with API call in background.
- Introduces countTokensAsync() in both handlers for asynchronous API calls.
- Vertex provider inherits changes from GeminiHandler.
Testing:
- Adds tests in anthropic.spec.ts and gemini.spec.ts to verify asynchronous behavior and immediate returns.
- Ensures token counting returns immediately (< 100ms) even if API call is delayed.
Misc:
- No breaking changes to the API.
- Logs errors to console if async API call fails.

^{This description was created by}^{for 0f08bce. You can customize this summary. It will automatically update as commits are pushed.}

- Modified countTokens() in GeminiHandler to return tiktoken estimate immediately - Modified countTokens() in AnthropicHandler to return tiktoken estimate immediately - API calls now happen asynchronously in the background without blocking inference - Added tests to verify asynchronous behavior and immediate returns - Vertex provider automatically inherits the fix from GeminiHandler Fixes #3666

daniel-lxs · 2025-07-22T15:47:42Z

Incorrect approach, the issue isn't scoped with enough detail

roomote bot requested review from cte, jr and mrubens as code owners July 22, 2025 13:50

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Jul 22, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Jul 22, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Jul 22, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Jul 22, 2025

roomote bot mentioned this pull request Jul 22, 2025

Gemini/Anthropic: Stop using remote token count APIs #3666

Closed

4 tasks

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Jul 22, 2025

daniel-lxs closed this Jul 22, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Jul 22, 2025

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Jul 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: make token counting API requests asynchronous for Gemini/Anthropic #6063

fix: make token counting API requests asynchronous for Gemini/Anthropic #6063

Uh oh!

roomote bot commented Jul 22, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

daniel-lxs commented Jul 22, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: make token counting API requests asynchronous for Gemini/Anthropic #6063

fix: make token counting API requests asynchronous for Gemini/Anthropic #6063

Uh oh!

Conversation

roomote bot commented Jul 22, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Benefits

Testing

Uh oh!

daniel-lxs commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

roomote bot commented Jul 22, 2025 •

edited by ellipsis-dev bot

Loading

daniel-lxs commented Jul 22, 2025 •

edited

Loading