Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
21 changes: 21 additions & 0 deletions openapi.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -16557,6 +16557,27 @@ components:
default: false
stream_options:
$ref: "#/components/schemas/ChatCompletionStreamOptions"
thinking:
type: object
nullable: true
description: |
View the thinking/reasoning tokens as part of your response. Thinking models produce a long internal chain of thought before generating a response. Supported only for specific Claude models on Anthropic, Google Vertex AI, and AWS Bedrock. Requires setting `strict_openai_compliance = false` in your API call.
properties:
type:
type: string
enum: ["enabled", "disabled"]
description: Enables or disables the thinking mode capability.
default: "disabled"
budget_tokens:
type: integer
description: |
The maximum number of tokens to allocate for the thinking process.
A higher token budget allows for more thorough reasoning but may increase overall response time.
minimum: 1
example: 2030
required:
- type
example: { "type": "enabled", "budget_tokens": 2030 }
temperature:
type: number
minimum: 0
Expand Down
Loading