You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- **cubesql:** Split `__user` WHERE predicate into separate filter node ([#8812](https://github.com/cube-js/cube/issues/8812)) ([83baf7b](https://github.com/cube-js/cube/commit/83baf7bf5f83108fd6c3dd134a8739968e781f92))
- **cubesql:** Match CubeScan timestamp literal types to member types ([#9275](https://github.com/cube-js/cube/issues/9275)) ([4a4e82b](https://github.com/cube-js/cube/commit/4a4e82ba602fc024a262a22ac65e3fcb7a4bba5c))
@@ -230,36 +230,125 @@ Search can only be used on string dimensions.
230
230
231
231
If desired, you may "bring your own" LLM model by providing a model and API credentials
232
232
for a supported model provider. Do this by setting environment variables in your Cube
233
-
deployment. See below for required variables by provider (required unless noted):
233
+
deployment.
234
+
235
+
- `CUBE_CLOUD_AI_COMPLETION_MODEL`- The AI model name to use (varies based on provider). For example `gpt-4o`.
236
+
- `CUBE_CLOUD_AI_COMPLETION_PROVIDER` - The provider. Must be one of the following:
237
+
- `amazon-bedrock`
238
+
- `anthropic`
239
+
- `azure`
240
+
- `cohere`
241
+
- `deepseek`
242
+
- `fireworks`
243
+
- `google-generative-ai`
244
+
- `google-vertex-ai`
245
+
- `google-vertex-ai-anthropic`
246
+
- `groq`
247
+
- `mistral`
248
+
- `openai`
249
+
- `openai-compatible`(any provider with an OpenAI-compatible API; support may vary)
250
+
- `together-ai`
251
+
- `x-ai`
252
+
253
+
See below for required variables by provider (required unless noted):
234
254
235
255
#### AWS Bedrock
236
256
237
257
<WarningBox>
238
-
The AI API currently supports only Anthropic Claude models on AWS Bedrock. Other
239
-
models may work but are not fully supported.
258
+
The AI API currently supports only Anthropic Claude models on AWS Bedrock.
259
+
Other models may work but are not fully supported.
240
260
</WarningBox>
241
261
242
-
- `CUBE_BEDROCK_MODEL_ID`- A supported [AWS Bedrock chat model](https://docs.aws.amazon.com/bedrock/latest/userguide/models-supported.html), for example `anthropic.claude-3-5-sonnet-20241022-v2:0`
243
-
- `CUBE_BEDROCK_ACCESS_KEY`- An access key for an IAM user with `InvokeModelWithResponseStream` permissions on the desired region/model.
244
-
- `CUBE_BEDROCK_ACCESS_SECRET`- The corresponding access secret
245
-
- `CUBE_BEDROCK_REGION_ID`- A supported AWS Bedrock region, for example `us-west-2`
262
+
- `CUBE_CLOUD_AI_AWS_ACCESS_KEY_ID`- An access key for an IAM user with `InvokeModelWithResponseStream` permissions on the desired region/model.
263
+
- `CUBE_CLOUD_AI_AWS_SECRET_ACCESS_KEY`- The corresponding access secret
264
+
- `CUBE_CLOUD_AI_AWS_REGION`- A supported AWS Bedrock region, for example `us-west-2`
265
+
- `CUBE_CLOUD_AI_AWS_SESSION_TOKEN`- The session token (optional)
246
266
247
-
#### GCP Vertex
267
+
#### Anthropic
268
+
269
+
- `CUBE_CLOUD_AI_ANTHROPIC_API_KEY`
270
+
- `CUBE_CLOUD_AI_ANTHROPIC_BASE_URL`- uses a different URL prefix for API calls, such as if you are using behind a proxy (optional)
271
+
272
+
#### Microsoft Azure OpenAI
273
+
274
+
- `CUBE_CLOUD_AI_AZURE_RESOURCE_NAME`
275
+
- `CUBE_CLOUD_AI_AZURE_API_KEY`
276
+
- `CUBE_CLOUD_AI_AZURE_API_VERSION`(optional)
277
+
- `CUBE_CLOUD_AI_AZURE_BASE_URL`(optional)
278
+
279
+
#### Cohere
280
+
281
+
- `CUBE_CLOUD_AI_COHERE_API_KEY`
282
+
- `CUBE_CLOUD_AI_COHERE_BASE_URL`- uses a different URL prefix for API calls, such as if you are using behind a proxy (optional)
283
+
284
+
#### DeepSeek
285
+
286
+
- `CUBE_CLOUD_AI_DEEPSEEK_API_KEY`
287
+
- `CUBE_CLOUD_AI_DEEPSEEK_BASE_URL`- uses a different URL prefix for API calls, such as if you are using behind a proxy (optional)
288
+
289
+
#### Fireworks
290
+
291
+
- `CUBE_CLOUD_AI_FIREWORKS_API_KEY`
292
+
- `CUBE_CLOUD_AI_FIREWORKS_BASE_URL`- uses a different URL prefix for API calls, such as if you are using behind a proxy (optional)
293
+
294
+
#### Google Generative AI
295
+
296
+
- `CUBE_CLOUD_AI_GOOGLE_GENERATIVE_AI_API_KEY`
297
+
- `CUBE_CLOUD_AI_GOOGLE_GENERATIVE_AI_BASE_URL`- uses a different URL prefix for API calls, such as if you are using behind a proxy (optional)
298
+
299
+
#### GCP Vertex AI
248
300
249
301
<WarningBox>
250
-
The AI API currently supports only Anthropic Claude models on GCP Vertex. Other
251
-
models may work but are not fully supported.
302
+
See <Btn>Google Vertex AI (Anthropic)</Btn> below if using Anthropic models
252
303
</WarningBox>
253
304
254
-
- `CUBE_VERTEX_MODEL_ID`- A supported GCP Vertex chat model, for example `claude-3-5-sonnet@20240620`
255
-
- `CUBE_VERTEX_PROJECT_ID`- The GCP project the model is deployed in
256
-
- `CUBE_VERTEX_REGION`- The GCP region the model is deployed in, for example `us-east5`
257
-
- `CUBE_VERTEX_CREDENTIALS`- The private key for a service account with permissions to run the chosen model
305
+
- `CUBE_CLOUD_AI_GOOGLE_VERTEX_PROJECT`
306
+
- `CUBE_CLOUD_AI_GOOGLE_VERTEX_LOCATION`
307
+
- `CUBE_CLOUD_AI_GOOGLE_VERTEX_CREDENTIALS`
308
+
- `CUBE_CLOUD_AI_GOOGLE_VERTEX_PUBLISHER`- defaults to `google`; change if using another publisher (optional)
0 commit comments