You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
> The latest version of model router is currently limited access only. You can request access using the `gpt-5 access` form: [gpt-5 limited access model application](https://aka.ms/oai/gpt5access). If you already have `o3 access` no request is required.
34
33
34
+
|Model router version|Underlying models| Model version
|[`codex-mini`](#o-series-models)| Fine-tuned version of o4-mini. |
27
28
|[`GPT-4.1 series`](#gpt-41-series)| The latest model release from Azure OpenAI. |
@@ -35,6 +36,30 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
35
36
|[`Image generation`](#image-generation-models)| A series of models that can generate original images from natural language. |
36
37
|[`Audio`](#audio-models)| A series of models for speech to text, translation, and text to speech. GPT-4o audio models support either low latency *speech in, speech out* conversational interactions or audio generation. |
37
38
39
+
## GPT-5
40
+
41
+
### Region availability
42
+
43
+
| Model | Region |
44
+
|---|---|
45
+
|`gpt-5` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
46
+
|`gpt-5-mini` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
47
+
|`gpt-5-nano` (2025-08-07) | East US 2 (Global Standard & Data Zones), Sweden Central (Global Standard & Data Zones)|
48
+
|`gpt-5-chat` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
49
+
50
+
-**[Registration is required for access to the gpt-5 model](https://aka.ms/oai/gpt5access).**
51
+
52
+
-`gpt-5-mini`, `gpt-5-nano`, and `gpt-5-chat` do not require registration.
53
+
54
+
Access will be granted based on Microsoft's eligibility criteria. Customers who previously applied and received access to `o3`, don't need to reapply as their approved subscriptions will automatically be granted access upon model release.
55
+
56
+
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
57
+
| --- | :--- |:--- |:---|:---: |
58
+
|`gpt-5` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 272,000 | 128,000 | October 24, 2024 |
59
+
|`gpt-5-mini` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 272,000 | 128,000 | June 24, 2024 |
60
+
|`gpt-5-nano` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md). | 272,000 | 128,000 | May 31, 2024 |
61
+
|`gpt-5-chat` (2025-08-07)<br>**Preview**| - Chat Completions API. <br> - [Responses API](../how-to/responses.md). - Text only | 128,000 | 16,384 | October 24, 2024 |
62
+
38
63
## gpt-oss
39
64
40
65
### Region availability
@@ -108,13 +133,15 @@ A model that intelligently selects from a set of underlying chat models to respo
108
133
109
134
| Model | Region |
110
135
|---|---|
111
-
|`model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard)|
136
+
|`model-router` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard) |
137
+
|`model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard) |
112
138
113
139
### Capabilities
114
140
115
141
| Model ID | Description | Context window | Max output tokens | Training data (up to) |
116
142
| --- | :--- |:--- |:---|:---: |
117
-
|`model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (GPT-4.1 series)</br> 100,000 (o4-mini) | May 31, 2024 |
143
+
|`model-router` (2025-08-07) | A model that intelligently selects from a set of underlying models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`)</br> 128,000 (`gpt-5 reasoning models`) </br> 16,384 (`gpt-5-chat`) | - |
144
+
|`model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`) | May 31, 2024 |
118
145
119
146
Larger context windows are compatible with *some* of the underlying models. That means an API call with a larger context succeeds only if the prompt happens to be routed to the right model. Otherwise, the call fails.
Copy file name to clipboardExpand all lines: articles/ai-foundry/openai/how-to/function-calling.md
+3Lines changed: 3 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -41,6 +41,9 @@ At a high level you can break down working with functions into three steps:
41
41
*`gpt-4o-mini` (`2024-07-18`)
42
42
*`gpt-4.1` (`2025-04-14`)
43
43
*`gpt-4.1-mini` (`2025-04-14`)
44
+
*`gpt-5` (`2025-08-07`)
45
+
*`gpt-5-mini` (`2025-08-07`)
46
+
*`gpt-5-nano` (`2025-08-07`)
44
47
45
48
Support for parallel function was first added in API version [`2023-12-01-preview`](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2023-12-01-preview/inference.json)
0 commit comments