You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|[`codex-mini`](#o-series-models)| Fine-tuned version of o4-mini. |
27
28
|[`GPT-4.1 series`](#gpt-41-series)| The latest model release from Azure OpenAI. |
@@ -35,6 +36,28 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
35
36
|[`Image generation`](#image-generation-models)| A series of models that can generate original images from natural language. |
36
37
|[`Audio`](#audio-models)| A series of models for speech to text, translation, and text to speech. GPT-4o audio models support either low latency *speech in, speech out* conversational interactions or audio generation. |
37
38
39
+
## GPT-5
40
+
41
+
### Region availability
42
+
43
+
| Model | Region |
44
+
|---|---|
45
+
|`gpt-5` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
46
+
|`gpt-5-mini` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
47
+
|`gpt-5-nano` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
48
+
|`gpt-5-chat` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard)|
49
+
50
+
**[Registration is required for access to GPT-5 series models](https://aka.ms/oai/gpt5access).**
51
+
52
+
Access will be granted based on Microsoft's eligibility criteria. Customers who previously applied and received access to `o3`, don't need to reapply as their approved subscriptions will automatically be granted access upon model release.
53
+
54
+
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
@@ -108,13 +131,15 @@ A model that intelligently selects from a set of underlying chat models to respo
108
131
109
132
| Model | Region |
110
133
|---|---|
111
-
|`model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard)|
134
+
|`model-router` (2025-08-07) | East US 2 (Global Standard), Sweden Central (Global Standard) |
135
+
|`model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard) |
112
136
113
137
### Capabilities
114
138
115
139
| Model ID | Description | Context window | Max output tokens | Training data (up to) |
116
140
| --- | :--- |:--- |:---|:---: |
117
-
|`model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (GPT-4.1 series)</br> 100,000 (o4-mini) | May 31, 2024 |
141
+
|`model-router` (2025-08-07) | A model that intelligently selects from a set of underlying models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`)</br> 128,000 (`gpt-5 reasoning models`) <br> 16,000 (`gpt-5-chat`) | - |
142
+
|`model-router` (2025-05-19) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. | 200,000 | 32,768 (`GPT-4.1 series`)</br> 100,000 (`o4-mini`) | May 31, 2024 |
118
143
119
144
Larger context windows are compatible with *some* of the underlying models. That means an API call with a larger context succeeds only if the prompt happens to be routed to the right model. Otherwise, the call fails.
0 commit comments