You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/openai/concepts/models.md
+10-22Lines changed: 10 additions & 22 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
---
2
2
title: Azure OpenAI in Azure AI Foundry Models
3
3
titleSuffix: Azure OpenAI
4
-
description: Learn about the different model capabilities that are available with Azure OpenAI.
4
+
description: Learn about the model capabilities that are available with Azure OpenAI.
5
5
author: mrbullwinkle #ChrisHMSFT
6
6
ms.author: mbullwin #chrhoder#
7
7
manager: nitinme
@@ -38,8 +38,6 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
38
38
39
39
### Region availability
40
40
41
-
For region availability, see the following table:
42
-
43
41
| Model | Region |
44
42
|---|---|
45
43
|`gpt-4.1` (2025-04-14) | See the [models table](#model-summary-table-and-region-availability). |
@@ -49,17 +47,17 @@ For region availability, see the following table:
49
47
### Capabilities
50
48
51
49
> [!IMPORTANT]
52
-
> There's a known issue that is affecting all GPT 4.1 series models. Large tool or function call definitions that exceed 300,000 tokens will result in failures, even though the 1 million token context limit of the models hasn't been reached.
50
+
> A known issue is affecting all GPT 4.1 series models. Large tool or function call definitions that exceed 300,000 tokens will result in failures, even though the 1 million token context limit of the models wasn't reached.
53
51
>
54
52
> The errors can vary based on API call and underlying payload characteristics.
55
53
>
56
-
> See the following error messages for the Chat Completions API:
54
+
> Here are the error messages for the Chat Completions API:
57
55
>
58
56
> -`Error code: 400 - {'error': {'message': "This model's maximum context length is 300000 tokens. However, your messages resulted in 350564 tokens (100 in the messages, 350464 in the functions). Please reduce the length of the messages or functions.", 'type': 'invalid_request_error', 'param': 'messages', 'code': 'context_length_exceeded'}}`
59
57
>
60
58
> -`Error code: 400 - {'error': {'message': "Invalid 'tools[0].function.description': string too long. Expected a string with maximum length 1048576, but got a string with length 2778531 instead.", 'type': 'invalid_request_error', 'param': 'tools[0].function.description', 'code': 'string_above_max_length'}}`
61
59
>
62
-
> See the following error message for the Responses API:
60
+
> Here's the error message for the Responses API:
63
61
>
64
62
> -`Error code: 500 - {'error': {'message': 'The server had an error processing your request. Sorry about that! You can retry your request, or contact us through an Azure support request at: https://go.microsoft.com/fwlink/?linkid=2213926 if you keep seeing this error. (Please include the request ID d2008353-291d-428f-adc1-defb5d9fb109 in your email.)', 'type': 'server_error', 'param': None, 'code': None}}`
65
63
@@ -77,8 +75,6 @@ A model that intelligently selects from a set of underlying chat models to respo
77
75
78
76
### Region availability
79
77
80
-
For region availability, see the following table:
81
-
82
78
| Model | Region |
83
79
|---|---|
84
80
|`model-router` (2025-05-19) | East US 2 (Global Standard), Sweden Central (Global Standard)|
@@ -104,12 +100,10 @@ It can be used with third-party libraries to allow the model to control mouse an
104
100
105
101
Registration is required to access `computer-use-preview`. Access is granted based on Microsoft's eligibility criteria. Customers who have access to other limited access models still need to request access for this model.
106
102
107
-
To request access, go to [`computer-use-preview` limited access model application](https://aka.ms/oai/cuaaccess). Once access is granted, you need to create a deployment for the model.
103
+
To request access, go to [`computer-use-preview` limited access model application](https://aka.ms/oai/cuaaccess). When access is granted, you need to create a deployment for the model.
108
104
109
105
### Region availability
110
106
111
-
For region availability, see the following table:
112
-
113
107
| Model | Region |
114
108
|---|---|
115
109
|`computer-use-preview`| See the [models table](#model-summary-table-and-region-availability). |
@@ -141,8 +135,6 @@ To learn more about advanced o-series models, see [Getting started with reasonin
141
135
142
136
### Region availability
143
137
144
-
For region availability, see the following table:
145
-
146
138
| Model | Region |
147
139
|---|---|
148
140
|`codex-mini`| East US2 & Sweden Central (Global Standard). |
@@ -212,7 +204,7 @@ To learn more about how to interact with GPT-3.5 Turbo and the Chat Completions
212
204
213
205
## Embeddings
214
206
215
-
`text-embedding-3-large` is the latest and most capable embedding model. You can't upgrade between embeddings models. In order to move from using `text-embedding-ada-002` to `text-embedding-3-large`, you would need to generate new embeddings.
207
+
`text-embedding-3-large` is the latest and most capable embedding model. You can't upgrade between embeddings models. To move from using `text-embedding-ada-002` to `text-embedding-3-large`, you need to generate new embeddings.
216
208
217
209
-`text-embedding-3-large`
218
210
-`text-embedding-3-small`
@@ -235,12 +227,10 @@ The image generation models generate images from text prompts that the user prov
235
227
236
228
Registration is required to access `gpt-image-1`. Access is granted based on Microsoft's eligibility criteria. Customers who have access to other limited access models still need to request access for this model.
237
229
238
-
To request access, go to [`gpt-image-1` limited access model application](https://aka.ms/oai/gptimage1access). Once access is granted, you need to create a deployment for the model.
230
+
To request access, go to [`gpt-image-1` limited access model application](https://aka.ms/oai/gptimage1access). When access is granted, you need to create a deployment for the model.
239
231
240
232
### Region availability
241
233
242
-
For region availability, see the following table:
243
-
244
234
| Model | Region |
245
235
|---|---|
246
236
|`dall-e-3`| East US<br>Australia East<br>Sweden Central|
@@ -252,8 +242,6 @@ Sora is an AI model from OpenAI that can create realistic and imaginative video
252
242
253
243
### Region availability
254
244
255
-
For region availability, see the following table:
256
-
257
245
| Model | Region |
258
246
|---|---|
259
247
|`sora`| East US 2|
@@ -407,7 +395,7 @@ This table doesn't include fine-tuning regional availability information. Consul
407
395
408
396
### GPT-3.5 models
409
397
410
-
See [model versions](../concepts/model-versions.md) to learn about how Azure OpenAI handles model version upgrades. See [working with models](../how-to/working-with-models.md) to learn how to view and configure the model version settings of your GPT-3.5 Turbo deployments.
398
+
To learn about how Azure OpenAI handles model version upgrades, see [Model versions](../concepts/model-versions.md). To learn how to view and configure the model version settings of your GPT-3.5 Turbo deployments, see [Working with models](../how-to/working-with-models.md).
411
399
412
400
# [Embeddings](#tab/standard-embeddings)
413
401
@@ -418,7 +406,7 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
418
406
These models can be used only with Embedding API requests.
419
407
420
408
> [!NOTE]
421
-
> `text-embedding-3-large` is the latest and most capable embedding model. You can't upgrade between embedding models. In order to migrate from using `text-embedding-ada-002` to `text-embedding-3-large`, you need to generate new embeddings.
409
+
> `text-embedding-3-large` is the latest and most capable embedding model. You can't upgrade between embedding models. To migrate from using `text-embedding-ada-002` to `text-embedding-3-large`, you need to generate new embeddings.
422
410
423
411
| Model ID | Max request (tokens) | Output dimensions |Training data (up to)
424
412
|---|---| :---:|:---:|:---:|
@@ -498,4 +486,4 @@ For the latest information on model retirements, refer to the [model retirement
498
486
-[Model retirement and deprecation](./model-retirements.md)
499
487
-[Learn more about working with Azure OpenAI models](../how-to/working-with-models.md)
500
488
-[Learn more about Azure OpenAI](../overview.md)
501
-
-[Learn more about finetuning Azure OpenAI models](../how-to/fine-tuning.md)
489
+
-[Learn more about fine-tuning Azure OpenAI models](../how-to/fine-tuning.md)
0 commit comments