Google/Vertex Request Builder by codeshaunted · Pull Request #3266 · BoundaryML/baml

codeshaunted · 2026-03-23T22:37:35Z

Summary by CodeRabbit

New Features
- Added support for Google Vertex AI (Gemini) and Google AI (Gemini) model providers
- Enabled streaming requests for Google-powered LLM services
- Added media content support (images and videos) for these providers
- Implemented support for Anthropic models running on Vertex AI infrastructure

vercel · 2026-03-23T22:37:40Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
beps	Ready	Preview, Comment	Mar 23, 2026 11:36pm
promptfiddle	Ready	Preview, Comment	Mar 23, 2026 11:36pm

coderabbitai · 2026-03-23T22:37:43Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: 6cea2dc1-a1dd-41a9-951c-e5bd4407a770

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

📝 Walkthrough

Walkthrough

Added support for Google Cloud AI Platform (Vertex AI) and Google AI with Gemini models through a new GoogleBuilder that constructs HTTP requests using the google-cloud-aiplatform-v1 SDK. Updated request routing to distinguish between Anthropic-on-Vertex and native Vertex/Google AI implementations, with Anthropic-on-Vertex using Anthropic builder with URL rewriting.

Changes

Cohort / File(s)	Summary
Dependencies `baml_language/Cargo.toml`, `baml_language/crates/sys_llm/Cargo.toml`	Added `google-cloud-aiplatform-v1` (v1.8) dependency with `prediction-service` feature to workspace and sys_llm crate.
Google Builder Implementation `baml_language/crates/sys_llm/src/build_request/google.rs`	New file implementing `GoogleBuilder` with HTTP request construction for Gemini models, supporting Vertex AI and Google AI URLs, request body building, prompt-to-GCP type conversion, and media handling (text, image, video). Includes comprehensive unit tests for URL generation, body construction, option forwarding, and media support.
Request Routing & Authorization `baml_language/crates/sys_llm/src/build_request/mod.rs`	Added `google` module and `is_anthropic_on_vertex()` helper. Updated routing to dispatch `VertexAi` + Anthropic to `AnthropicBuilder` with URL rewriting, `VertexAi` + non-Anthropic to `GoogleBuilder`, and `GoogleAi` to `GoogleBuilder`. Authorization updated to handle Anthropic-on-Vertex separately and leave non-Anthropic Vertex/Google requests unauthenticated. Added Vertex AI and Google AI test coverage.
Provider Enum Updates `baml_language/crates/sys_llm/src/provider.rs`	Reorganized `LlmProvider` enum: moved `GoogleAi` variant and updated doc comments to clarify both `VertexAi` and `GoogleAi` use `GenerateContentRequest` serialization.

Sequence Diagram

sequenceDiagram
    participant Client
    participant Router as Request Router
    participant AnthropicBuilder
    participant GoogleBuilder
    participant VertexAPI as Vertex API
    participant GoogleAPI as Google AI API

    Client->>Router: build_request(provider, model, prompt)
    
    alt VertexAi + Anthropic (claude-*)
        Router->>AnthropicBuilder: build_request()
        AnthropicBuilder->>AnthropicBuilder: construct Anthropic request
        AnthropicBuilder-->>Router: return Anthropic HTTP request
        Router->>Router: rewrite URL to<br/>Vertex rawPredict/streamRawPredict
        Router-->>Client: return Vertex-addressed request
    else VertexAi + Gemini
        Router->>GoogleBuilder: build_request()
        GoogleBuilder->>GoogleBuilder: build Vertex URL<br/>convert prompt to GCP types<br/>serialize GenerateContentRequest
        GoogleBuilder-->>Router: return HTTP request
        Router-->>Client: return request (no auth)
    else GoogleAi
        Router->>GoogleBuilder: build_request()
        GoogleBuilder->>GoogleBuilder: build Google AI URL<br/>convert prompt to GCP types<br/>serialize GenerateContentRequest
        GoogleBuilder-->>Router: return HTTP request
        Router-->>Client: return request (no auth)
    end

    Client->>VertexAPI: POST request
    Client->>GoogleAPI: POST request

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

anthropic provider #3235: Adds and refactors Anthropic build_request implementation, which is directly integrated by this PR's Anthropic-on-Vertex routing that forwards to the Anthropic builder and rewrites URLs to Vertex endpoints.

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Google/Vertex Request Builder' directly and specifically describes the main change - adding a new request builder for Google Cloud and Vertex AI platforms.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch avery/google

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codeshaunted · 2026-03-23T22:37:45Z

@coderabbitai review

coderabbitai · 2026-03-23T22:37:51Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

codspeed-hq · 2026-03-23T22:44:18Z

Merging this PR will not alter performance

⚠️

Unknown Walltime execution environment detected

Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.

For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.

✅ 15 untouched benchmarks
⏩ 91 skipped benchmarks¹

_{Comparing avery/google (dd853c1) with canary (dddd673)}

91 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩

coderabbitai

Actionable comments posted: 3

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: ASSERTIVE

Plan: Pro

Run ID: e62a614d-9242-4350-b7d0-37b6cd05eb59

📥 Commits

Reviewing files that changed from the base of the PR and between dddd673 and 8218aa8.

⛔ Files ignored due to path filters (1)

baml_language/Cargo.lock is excluded by !**/*.lock

📒 Files selected for processing (5)

baml_language/Cargo.toml
baml_language/crates/sys_llm/Cargo.toml
baml_language/crates/sys_llm/src/build_request/google.rs
baml_language/crates/sys_llm/src/build_request/mod.rs
baml_language/crates/sys_llm/src/provider.rs

coderabbitai · 2026-03-23T22:47:58Z

baml_language/crates/sys_llm/src/build_request/google.rs

+    Ok(format!(
+        "https://{domain}/v1/projects/{project_id}/locations/{location}/publishers/google/models/{model}:{rpc}"
+    ))


⚠️ Potential issue | 🔴 Critical

Incorrect publisher path for Anthropic models.

The URL template uses publishers/google for Anthropic/Claude models:

"https://{domain}/v1/projects/{project_id}/locations/{location}/publishers/google/models/{model}:{rpc}"

Anthropic models on Vertex AI require publishers/anthropic in the path, not publishers/google. This will cause 404 errors when making requests to Claude models.

🐛 Proposed fix

Ok(format!( - "https://{domain}/v1/projects/{project_id}/locations/{location}/publishers/google/models/{model}:{rpc}" + "https://{domain}/v1/projects/{project_id}/locations/{location}/publishers/anthropic/models/{model}:{rpc}" ))

coderabbitai · 2026-03-23T22:47:59Z

baml_language/crates/sys_llm/src/build_request/mod.rs

+        LlmProvider::VertexAi if is_anthropic_on_vertex(client) => {
+            // Anthropic-on-Vertex needs the anthropic-version header.
+            // Vertex OAuth2 auth is not yet implemented here.
+            crate::auth_request::AnthropicAuth
+                .authorize(raw, client, &callbacks)
+                .await?
+        }
+        LlmProvider::VertexAi | LlmProvider::GoogleAi => {
+            // Auth not yet implemented -- return the raw request as-is.
+            raw
+        }


⚠️ Potential issue | 🟠 Major

Missing authentication implementation for Vertex AI and Google AI.

The authorization section returns the raw request unchanged for non-Anthropic Vertex AI and Google AI providers. Per the context snippet from auth_request/mod.rs, only AnthropicAuth and OpenAiAuth implementations exist — there's no GoogleAuth or VertexAuth.

This means:

Vertex AI requests will lack the required OAuth2 bearer token

Google AI requests will lack the API key (typically passed as ?key=... query param)

Both providers will fail at runtime without proper authentication.

Would you like me to help implement the GoogleAuth handler that:

For Google AI: appends the api_key as a query parameter (?key=...)

For Vertex AI: adds OAuth2 bearer token from credentials/ADC

coderabbitai · 2026-03-23T22:47:59Z

baml_language/crates/sys_llm/src/build_request/mod.rs

+        assert!(result.url.contains("claude-3-5-sonnet@20241022:rawPredict"),);
+        assert_eq!(
+            result.url,
+            "https://us-east5-aiplatform.googleapis.com/v1/projects/my-project/locations/us-east5/publishers/google/models/claude-3-5-sonnet@20241022:rawPredict"
+        );


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🌐 Web query:

Vertex AI Anthropic Claude rawPredict endpoint URL format

💡 Result:

The rawPredict endpoint URL format for Vertex AI Anthropic Claude models is: https://{LOCATION}-aiplatform.googleapis.com/v1/projects/{PROJECT_ID}/locations/{LOCATION}/publishers/anthropic/models/{MODEL}:rawPredict Where: - {LOCATION}: The region (e.g., 'us-central1', 'global' for global endpoint). - {PROJECT_ID}: Your Google Cloud project ID. - {MODEL}: The Claude model ID (e.g., 'claude-3-5-sonnet@20240620', 'claude-opus-4-6'). Use POST method with Authorization Bearer token and JSON body including "anthropic_version": "vertex-2023-10-16", messages, max_tokens, etc. For streaming, use :streamRawPredict instead.

Citations:

1: https://docs.anthropic.com/en/api/claude-on-vertex-ai

2: https://docs.cloud.google.com/vertex-ai/docs/reference/rest/v1/projects.locations.endpoints/rawPredict

3: https://docs.cloud.google.com/vertex-ai/generative-ai/docs/partner-models/claude/use-claude

4: https://minimusarchitecture.com/?_=%2Fvertex-ai%2Fgenerative-ai%2Fdocs%2Fpartner-models%2Fclaude%2Fuse-claude%23NpZ%2FcdvQaILjKBVh9YuxMiuGJWbHhSDB4mkVfbY%3D

5: https://platform.claude.com/docs/en/api/claude-on-vertex-ai

🏁 Script executed:

# Read the test file to verify the exact code at lines 1131-1135 sed -n '1100,1160p' baml_language/crates/sys_llm/src/build_request/mod.rs

Repository: BoundaryML/baml

Length of output: 2226

🏁 Script executed:

# Also check the broader context of this test function to understand what it's testing sed -n '1097,1155p' baml_language/crates/sys_llm/src/build_request/mod.rs

Repository: BoundaryML/baml

Length of output: 2186

🏁 Script executed:

# Search for the implementation that constructs these URLs to understand the logic rg "publishers" baml_language/crates/sys_llm/src/build_request/ -A 2 -B 2

Repository: BoundaryML/baml

Length of output: 2996

🏁 Script executed:

# Check if there's conditional logic for Anthropic vs Google models rg "anthropic" baml_language/crates/sys_llm/src/build_request/ -B 3 -A 3

Repository: BoundaryML/baml

Length of output: 43215

🏁 Script executed:

# Look at the full google.rs implementation to see the build_url function context cat -n baml_language/crates/sys_llm/src/build_request/google.rs | head -150

Repository: BoundaryML/baml

Length of output: 6661

🏁 Script executed:

# Check the complete vertex_anthropic_url function sed -n '89,123p' baml_language/crates/sys_llm/src/build_request/google.rs

Repository: BoundaryML/baml

Length of output: 1329

Fix Vertex AI URL publisher for Anthropic models: use publishers/anthropic instead of publishers/google.

The Vertex AI URL for Anthropic models must use publishers/anthropic, not publishers/google. This issue exists in both the implementation and the test:

Implementation (google.rs line 121): The vertex_anthropic_url() function hardcodes publishers/google in the URL template.

Test (lines 1131-1135): The assertion expects the incorrect URL with publishers/google.

According to official Google Cloud and Anthropic documentation, the correct format is:

https://{location}-aiplatform.googleapis.com/v1/projects/{project_id}/locations/{location}/publishers/anthropic/models/{model}:rawPredict

Using publishers/google will cause API requests to fail in production.

github-actions · 2026-03-23T22:51:31Z

Binary size checks passed

✅ 7 passed

	Artifact	Platform	Gzip	Baseline	Delta	Status
✅	`bridge_cffi`	Linux	5.7 MB	5.7 MB	+9.5 KB (+0.2%)	OK
✅	`bridge_cffi-stripped`	Linux	4.3 MB	4.3 MB	+7.6 KB (+0.2%)	OK
✅	`bridge_cffi`	macOS	4.6 MB	4.6 MB	+7.4 KB (+0.2%)	OK
✅	`bridge_cffi-stripped`	macOS	3.5 MB	3.5 MB	+4.6 KB (+0.1%)	OK
✅	`bridge_cffi`	Windows	4.6 MB	4.6 MB	+4.8 KB (+0.1%)	OK
✅	`bridge_cffi-stripped`	Windows	3.6 MB	3.5 MB	+5.2 KB (+0.1%)	OK
✅	`bridge_wasm`	WASM	3.0 MB	3.0 MB	+3.6 KB (+0.1%)	OK

Generated by cargo size-gate · workflow run

codeshaunted · 2026-03-23T23:14:16Z

@coderabbitai review

coderabbitai · 2026-03-23T23:14:25Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

vercel bot deployed to Preview – beps March 23, 2026 22:38 View deployment

coderabbitai bot requested changes Mar 23, 2026

View reviewed changes

vercel bot deployed to Preview – beps March 23, 2026 22:52 View deployment

vercel bot deployed to Preview – promptfiddle March 23, 2026 23:12 View deployment

codeshaunted added 2 commits March 23, 2026 19:16

draft of google/vertex request building

285f292

migrate away from using actual SDK for requests

dd853c1

codeshaunted force-pushed the avery/google branch from bdeb626 to dd853c1 Compare March 23, 2026 23:16

vercel bot deployed to Preview – beps March 23, 2026 23:17 View deployment

vercel bot deployed to Preview – promptfiddle March 23, 2026 23:36 View deployment

Conversation

codeshaunted commented Mar 23, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

vercel bot commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Uh oh!

codeshaunted commented Mar 23, 2026

Uh oh!

coderabbitai bot commented Mar 23, 2026

Uh oh!

codspeed-hq bot commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Footnotes

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Binary size checks passed

Uh oh!

codeshaunted commented Mar 23, 2026

Uh oh!

coderabbitai bot commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codeshaunted commented Mar 23, 2026 •

edited by coderabbitai bot

Loading

vercel bot commented Mar 23, 2026 •

edited

Loading

coderabbitai bot commented Mar 23, 2026 •

edited

Loading

codspeed-hq bot commented Mar 23, 2026 •

edited

Loading

github-actions bot commented Mar 23, 2026 •

edited

Loading