feat: vertex/gemini prompt caching #2996

ashktn · 2025-04-28T04:04:24Z

Moving to @google/genai and reusing the gemini provider when using gemini on vertex ai

Context

Use the gemini provider when using a gemini model is selected on vertex ai.
@google/genai now support both Google Gemini and Vertex AI APIs
Prompt caching is already implemented in the gemini provider

Implementation

Used the gemini provider when a gemini model is selected on vertex ai.

Screenshots

Screen.Recording.2025-04-28.at.00.02.30.mov

How to Test

Select a gemini model that supports prompt caching on vertex ai. enable prompt caching

Get in Touch

ashktn

Important

Integrates Gemini prompt caching on Vertex AI by using GeminiHandler, updating models, and modifying tests and configurations.

Behavior:
- Integrates GeminiHandler for prompt caching with Gemini models on Vertex AI in vertex.ts and gemini.ts.
- Updates vertexModels in api.ts to support prompt caching for Gemini models.
- Removes @google-cloud/vertexai dependency from package.json.
Testing:
- Updates vertex.test.ts to mock GeminiHandler and test prompt caching behavior.
- Removes vertex-gemini-format.test.ts and vertex-gemini-format.ts as they are no longer needed.
Configuration:
- Adds isVertex option to ProviderSettings in roo-code.d.ts, types.ts, and schemas/index.ts to distinguish between Gemini and Vertex AI usage.

^{This description was created by}^{for f867837e867308b3e74f0cd15bf6701537d34b83. You can customize this summary. It will automatically update as commits are pushed.}

changeset-bot · 2025-04-28T04:04:28Z

🦋 Changeset detected

Latest commit: 77e4df6

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
roo-cline	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

ellipsis-dev · 2025-04-28T04:07:12Z

src/api/providers/gemini.ts

Consider wrapping JSON.parse(this.options.vertexJsonCredentials) in a try/catch block to handle potential JSON parsing errors. This improves resiliency if invalid credentials are provided.

^{This comment was generated because it violated a code review rule: mrule_OR1S8PRRHcvbdFib.}

Moving to @google/genai and reusing the gemini provider when using gemini on vertex ai

adamhill · 2025-04-28T13:52:07Z

Thanks for tackling this difficult problem!

Even though you don't think it is perfect. We don't have to eat the elephant all in on bite. The whole troop can work on improving it now. 🦘❤️

Adding your comment from the channel Thread for completeness:

It seems like the right direction to use the @google/genai lib over @google-cloud/vertexai since google is encouraging using genai. The new lib says it works for Gemini 2.0, but i had no issues using Gemini 1.5

From a code structure perspective, it just didn't feel clean importing one provider from the other provider. As a future refactor task, we could move the common gemini code to a new file and then import that file in both providers

mrubens · 2025-04-28T15:22:08Z

@cte are you able to review this one? Thanks!

cte

I moved some stuff around; LMK if you any questions about my changes.

ashktn · 2025-04-28T19:54:54Z

I moved some stuff around; LMK if you any questions about my changes.

looks great!

ashktn · 2025-04-28T20:01:14Z

minor: seems like gemini-2.5-flash-preview-04-17 supports caching as per this doc- could be different between gemini vs vertex
https://cloud.google.com/vertex-ai/generative-ai/docs/context-cache/context-cache-overview#supported_models

ashktn requested review from cte and mrubens as code owners April 28, 2025 04:04

github-project-automation bot moved this to New in Roo Code Roadmap Apr 28, 2025

github-project-automation bot added this to Roo Code Roadmap Apr 28, 2025

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. Enhancement New feature or request labels Apr 28, 2025

ellipsis-dev bot reviewed Apr 28, 2025

View reviewed changes

feat: vertex/gemini prompt caching

96b7921

Moving to @google/genai and reusing the gemini provider when using gemini on vertex ai

ashktn force-pushed the feat/vertex-gemini-caching branch from f867837 to 96b7921 Compare April 28, 2025 11:23

cte approved these changes Apr 28, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Apr 28, 2025

Merge branch 'main' into cte/vertex-gemini-caching

85ef3ad

hannesrudolph moved this from New to PR [Pre Approval Review] in Roo Code Roadmap Apr 28, 2025

Cleanup

bf92e8b

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:XL This PR changes 500-999 lines, ignoring generated files. labels Apr 28, 2025

cte added 2 commits April 28, 2025 11:56

Cleanup

fffae25

gemini-2.5-flash-preview-04-17 doesn't have caching

8b40a74

cte approved these changes Apr 28, 2025

View reviewed changes

cte added 4 commits April 28, 2025 13:57

Merge branch 'main' into cte/vertex-gemini-caching

0c8e03c

Fix JSON parse error

2c9a790

Fix JSON parse error

fa82dff

Fix tests

77e4df6

cte merged commit a3d8c0e into RooCodeInc:main Apr 28, 2025
12 checks passed

github-project-automation bot moved this from PR [Pre Approval Review] to Done in Roo Code Roadmap Apr 28, 2025

shariqriazz pushed a commit to shariqriazz/Roo-Code that referenced this pull request Apr 29, 2025

feat: vertex/gemini prompt caching (RooCodeInc#2996)

09a9cc6

hannesrudolph mentioned this pull request Apr 29, 2025

Promt Caching for Gemini 2.5 Flash #2989

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: vertex/gemini prompt caching #2996

feat: vertex/gemini prompt caching #2996

Uh oh!

ashktn commented Apr 28, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

changeset-bot bot commented Apr 28, 2025 •

edited

Loading

Uh oh!

ellipsis-dev bot Apr 28, 2025

Uh oh!

adamhill commented Apr 28, 2025

Uh oh!

mrubens commented Apr 28, 2025

Uh oh!

cte left a comment

Uh oh!

ashktn commented Apr 28, 2025

Uh oh!

ashktn commented Apr 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: vertex/gemini prompt caching #2996

feat: vertex/gemini prompt caching #2996

Uh oh!

Conversation

ashktn commented Apr 28, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Implementation

Screenshots

How to Test

Get in Touch

Uh oh!

changeset-bot bot commented Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

ellipsis-dev bot Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

adamhill commented Apr 28, 2025

Uh oh!

mrubens commented Apr 28, 2025

Uh oh!

cte left a comment

Choose a reason for hiding this comment

Uh oh!

ashktn commented Apr 28, 2025

Uh oh!

ashktn commented Apr 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ashktn commented Apr 28, 2025 •

edited by ellipsis-dev bot

Loading

changeset-bot bot commented Apr 28, 2025 •

edited

Loading