You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
TwelveLabs Marengo models support multimodal embeddings (text, image, video, audio) and require the `input_type` parameter to specify the input format.
Copy file name to clipboardExpand all lines: docs/my-website/docs/pass_through/vertex_ai.md
+47-2Lines changed: 47 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,10 +15,11 @@ Pass-through endpoints for Vertex AI - call provider-specific endpoint, in nativ
15
15
16
16
## Supported Endpoints
17
17
18
-
LiteLLM supports 2 vertex ai passthrough routes:
18
+
LiteLLM supports 3 vertex ai passthrough routes:
19
19
20
20
1.`/vertex_ai` → routes to `https://{vertex_location}-aiplatform.googleapis.com/`
21
21
2.`/vertex_ai/discovery` → routes to [`https://discoveryengine.googleapis.com`](https://discoveryengine.googleapis.com/)
22
+
3.`/vertex_ai/live` → upgrades to the Vertex AI Live API WebSocket (`google.cloud.aiplatform.v1.LlmBidiService/BidiGenerateContent`)
22
23
23
24
## How to use
24
25
@@ -170,6 +171,50 @@ generateContent();
170
171
</Tabs>
171
172
172
173
174
+
## Vertex AI Live API WebSocket
175
+
176
+
LiteLLM can now proxy the Vertex AI Live API to help you experiment with streaming audio/text from Gemini Live models without exposing Google credentials to clients.
177
+
178
+
- Configure default Vertex credentials via `default_vertex_config` or environment variables (see examples above).
179
+
- Connect to `wss://<PROXY_URL>/vertex_ai/live`. LiteLLM will exchange your saved credentials for a short-lived access token and forward messages bidirectionally.
180
+
- Optional query params `vertex_project`, `vertex_location`, and `model` let you override defaults for multi-project setups or global-only models.
0 commit comments