Skip to content

Commit 0f5de60

Browse files
authored
Merge pull request #282 from openai/dev/zhang/fix
Fixes documentation error
2 parents 2d3212f + c5cc333 commit 0f5de60

File tree

1 file changed

+18
-12
lines changed

1 file changed

+18
-12
lines changed

openapi.yaml

Lines changed: 18 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -7206,17 +7206,6 @@ components:
72067206
If specified, our system will make a best effort to sample deterministically, such that repeated requests with the same `seed` and parameters should return the same result.
72077207

72087208
Determinism is not guaranteed, and you should refer to the `system_fingerprint` response parameter to monitor changes in the backend.
7209-
service_level:
7210-
description: |
7211-
Specifies the latency tier to use for processing the request. This parameter is relevant for customers subscribed to the scale tier service:
7212-
- If set to 'auto', the system will utilize scale tier credits until they are exhausted.
7213-
- If set to 'default', the request will be processed in the shared cluster.
7214-
7215-
When this parameter is set, the response body will include the `service_tier` utilized.
7216-
type: string
7217-
enum: ["auto", "default"]
7218-
nullable: true
7219-
default: null
72207209
stop:
72217210
description: &completions_stop_description >
72227211
Up to 4 sequences where the API will stop generating further tokens. The returned text will not contain the stop sequence.
@@ -7936,6 +7925,17 @@ components:
79367925
Determinism is not guaranteed, and you should refer to the `system_fingerprint` response parameter to monitor changes in the backend.
79377926
x-oaiMeta:
79387927
beta: true
7928+
service_tier:
7929+
description: |
7930+
Specifies the latency tier to use for processing the request. This parameter is relevant for customers subscribed to the scale tier service:
7931+
- If set to 'auto', the system will utilize scale tier credits until they are exhausted.
7932+
- If set to 'default', the request will be processed in the shared cluster.
7933+
7934+
When this parameter is set, the response body will include the `service_tier` utilized.
7935+
type: string
7936+
enum: ["auto", "default"]
7937+
nullable: true
7938+
default: null
79397939
stop:
79407940
description: |
79417941
Up to 4 sequences where the API will stop generating further tokens.
@@ -8077,7 +8077,7 @@ components:
80778077
model:
80788078
type: string
80798079
description: The model used for the chat completion.
8080-
scale_tier:
8080+
service_tier:
80818081
description: The service tier used for processing the request. This field is only included if the `service_tier` parameter is specified in the request.
80828082
type: string
80838083
enum: ["scale", "default"]
@@ -8259,6 +8259,12 @@ components:
82598259
model:
82608260
type: string
82618261
description: The model to generate the completion.
8262+
service_tier:
8263+
description: The service tier used for processing the request. This field is only included if the `service_tier` parameter is specified in the request.
8264+
type: string
8265+
enum: ["scale", "default"]
8266+
example: "scale"
8267+
nullable: true
82628268
system_fingerprint:
82638269
type: string
82648270
description: |

0 commit comments

Comments
 (0)