Skip to content

Commit 73edb95

Browse files
authored
Merge branch 'main' into chore/no-overload-inherited-properties
2 parents 68af4bb + 1d91d74 commit 73edb95

25 files changed

+625
-70
lines changed

output/openapi/elasticsearch-openapi.json

Lines changed: 103 additions & 9 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

output/openapi/elasticsearch-serverless-openapi.json

Lines changed: 103 additions & 9 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

output/schema/schema.json

Lines changed: 162 additions & 42 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

output/typescript/types.ts

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

specification/inference/_types/CommonTypes.ts

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1445,6 +1445,8 @@ export class GoogleVertexAIServiceSettings {
14451445
* If `streaming_url` is not provided, `url` is also used for streaming `completion` and `chat_completion`.
14461446
* If `provider` is not provided or set to `google` (Google Vertex AI), do not set `url` (or `streaming_url`).
14471447
* At least one of `url` or `streaming_url` must be provided for Google Model Garden endpoint usage.
1448+
* Certain providers require separate URLs for streaming and non-streaming operations (e.g., Anthropic, Mistral, AI21). Others support both operation types through a single URL (e.g., Meta, Hugging Face).
1449+
* Information on constructing the URL for various providers can be found in the Google Model Garden documentation for the model, or on the endpoint’s `Sample request` page. The request examples also illustrate the proper formatting for the `url`.
14481450
*/
14491451
url?: string
14501452
/**
@@ -1453,6 +1455,8 @@ export class GoogleVertexAIServiceSettings {
14531455
* If `url` is not provided, `streaming_url` is also used for non-streaming `completion` requests.
14541456
* If `provider` is not provided or set to `google` (Google Vertex AI), do not set `streaming_url` (or `url`).
14551457
* At least one of `streaming_url` or `url` must be provided for Google Model Garden endpoint usage.
1458+
* Certain providers require separate URLs for streaming and non-streaming operations (e.g., Anthropic, Mistral, AI21). Others support both operation types through a single URL (e.g., Meta, Hugging Face).
1459+
* Information on constructing the URL for various providers can be found in the Google Model Garden documentation for the model, or on the endpoint’s `Sample request` page. The request examples also illustrate the proper formatting for the `streaming_url`.
14561460
*/
14571461
streaming_url?: string
14581462
/**
@@ -1497,7 +1501,11 @@ export class GoogleVertexAIServiceSettings {
14971501

14981502
export enum GoogleModelGardenProvider {
14991503
google,
1500-
anthropic
1504+
anthropic,
1505+
meta,
1506+
hugging_face,
1507+
mistral,
1508+
ai21
15011509
}
15021510

15031511
export class GoogleVertexAITaskSettings {
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
summary: A chat_completion task for Google Model Garden Meta shared endpoint with single streaming URL provided
2+
description: Run `PUT _inference/chat_completion/google_model_garden_meta_chat_completion` to create an inference endpoint to perform a `chat_completion` task using Meta's model hosted on Google Model Garden shared endpoint with single streaming URL provided. See the endpoint's `Sample request` page for the variable values used in the URL.
3+
method_request: 'PUT _inference/chat_completion/google_model_garden_meta_chat_completion'
4+
# type: "request"
5+
value: |-
6+
{
7+
"service": "googlevertexai",
8+
"service_settings": {
9+
"provider": "meta",
10+
"service_account_json": "service-account-json",
11+
"streaming_url": "https://%LOCATION_ID%-aiplatform.googleapis.com/v1/projects/%PROJECT_ID%/locations/%LOCATION_ID%/endpoints/%ENDPOINT_ID%/chat/completions"
12+
}
13+
}
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
summary: A completion task for Google Model Garden Hugging Face dedicated endpoint with single URL provided for both streaming and non-streaming tasks
2+
description: Run `PUT _inference/completion/google_model_garden_hugging_face_completion` to create an inference endpoint to perform a `completion` task using Hugging Face's model hosted on Google Model Garden dedicated endpoint with single URL provided for both streaming and non-streaming tasks. See the endpoint's `Sample request` page for the variable values used in the URL.
3+
method_request: 'PUT _inference/completion/google_model_garden_hugging_face_completion'
4+
# type: "request"
5+
value: |-
6+
{
7+
"service": "googlevertexai",
8+
"service_settings": {
9+
"provider": "hugging_face",
10+
"service_account_json": "service-account-json",
11+
"url": "https://%ENDPOINT_ID%.%LOCATION_ID%-%PROJECT_ID%.prediction.vertexai.goog/v1/projects/%PROJECT_ID%/locations/%LOCATION_ID%/endpoints/%ENDPOINT_ID%/chat/completions"
12+
}
13+
}
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
summary: A chat_completion task for Google Model Garden Hugging Face dedicated endpoint with single streaming URL provided
2+
description: Run `PUT _inference/chat_completion/google_model_garden_hugging_face_chat_completion` to create an inference endpoint to perform a `chat_completion` task using Hugging Face's model hosted on Google Model Garden dedicated endpoint with single streaming URL provided. See the endpoint's `Sample request` page for the variable values used in the URL.
3+
method_request: 'PUT _inference/chat_completion/google_model_garden_hugging_face_chat_completion'
4+
# type: "request"
5+
value: |-
6+
{
7+
"service": "googlevertexai",
8+
"service_settings": {
9+
"provider": "hugging_face",
10+
"service_account_json": "service-account-json",
11+
"streaming_url": "https://%ENDPOINT_ID%.%LOCATION_ID%-%PROJECT_ID%.prediction.vertexai.goog/v1/projects/%PROJECT_ID%/locations/%LOCATION_ID%/endpoints/%ENDPOINT_ID%/chat/completions"
12+
}
13+
}
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
summary: A completion task for Google Model Garden Hugging Face shared endpoint with single URL provided for both streaming and non-streaming tasks
2+
description: Run `PUT _inference/completion/google_model_garden_hugging_face_completion` to create an inference endpoint to perform a `completion` task using Hugging Face's model hosted on Google Model Garden shared endpoint with single URL provided for both streaming and non-streaming tasks. See the endpoint's `Sample request` page for the variable values used in the URL.
3+
method_request: 'PUT _inference/completion/google_model_garden_hugging_face_completion'
4+
# type: "request"
5+
value: |-
6+
{
7+
"service": "googlevertexai",
8+
"service_settings": {
9+
"provider": "hugging_face",
10+
"service_account_json": "service-account-json",
11+
"url": "https://%LOCATION_ID%-aiplatform.googleapis.com/v1/projects/%PROJECT_ID%/locations/%LOCATION_ID%/endpoints/%ENDPOINT_ID%/chat/completions"
12+
}
13+
}
Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
summary: A chat_completion task for Google Model Garden Hugging Face shared endpoint with single streaming URL provided
2+
description: Run `PUT _inference/chat_completion/google_model_garden_hugging_face_chat_completion` to create an inference endpoint to perform a `chat_completion` task using Hugging Face's model hosted on Google Model Garden shared endpoint with single streaming URL provided. See the endpoint's `Sample request` page for the variable values used in the URL.
3+
method_request: 'PUT _inference/chat_completion/google_model_garden_hugging_face_chat_completion'
4+
# type: "request"
5+
value: |-
6+
{
7+
"service": "googlevertexai",
8+
"service_settings": {
9+
"provider": "hugging_face",
10+
"service_account_json": "service-account-json",
11+
"streaming_url": "https://%LOCATION_ID%-aiplatform.googleapis.com/v1/projects/%PROJECT_ID%/locations/%LOCATION_ID%/endpoints/%ENDPOINT_ID%/chat/completions"
12+
}
13+
}

0 commit comments

Comments
 (0)