Skip to content

Commit 8ab68b2

Browse files
committed
Adds service_tier param to chat/completions
1 parent 5fe14f6 commit 8ab68b2

File tree

1 file changed

+18
-1
lines changed

1 file changed

+18
-1
lines changed

openapi.yaml

Lines changed: 18 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ openapi: 3.0.0
22
info:
33
title: OpenAI API
44
description: The OpenAI REST API. Please see https://platform.openai.com/docs/api-reference for more details.
5-
version: "2.0.0"
5+
version: "2.1.0"
66
termsOfService: https://openai.com/policies/terms-of-use
77
contact:
88
name: OpenAI Support
@@ -7206,6 +7206,17 @@ components:
72067206
If specified, our system will make a best effort to sample deterministically, such that repeated requests with the same `seed` and parameters should return the same result.
72077207

72087208
Determinism is not guaranteed, and you should refer to the `system_fingerprint` response parameter to monitor changes in the backend.
7209+
service_level:
7210+
description: |
7211+
Specifies the latency tier to use for processing the request. This parameter is relevant for customers subscribed to the scale tier service:
7212+
- If set to 'auto', the system will utilize scale tier credits until they are exhausted.
7213+
- If set to 'default', the request will be processed in the shared cluster.
7214+
7215+
When this parameter is set, the response body will include the `service_tier` utilized.
7216+
type: string
7217+
enum: ["auto", "default"]
7218+
nullable: true
7219+
default: null
72097220
stop:
72107221
description: &completions_stop_description >
72117222
Up to 4 sequences where the API will stop generating further tokens. The returned text will not contain the stop sequence.
@@ -8066,6 +8077,12 @@ components:
80668077
model:
80678078
type: string
80688079
description: The model used for the chat completion.
8080+
scale_tier:
8081+
description: The service tier used for processing the request. This field is only included if the `service_tier` parameter is specified in the request.
8082+
type: string
8083+
enum: ["scale", "default"]
8084+
example: "scale"
8085+
nullable: true
80698086
system_fingerprint:
80708087
type: string
80718088
description: |

0 commit comments

Comments
 (0)