Skip to content

Commit 4f99432

Browse files
Merge pull request #6104 from MicrosoftDocs/main
Auto Publish – main to live - 2025-07-17 17:10 UTC
2 parents 96503a9 + 230afae commit 4f99432

File tree

21 files changed

+365
-215
lines changed

21 files changed

+365
-215
lines changed

articles/ai-foundry/openai/concepts/models.md

Lines changed: 1 addition & 41 deletions
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,6 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
2626
| [GPT-4.1 series](#gpt-41-series) | Latest model release from Azure OpenAI |
2727
| [model-router](#model-router) | A model that intelligently selects from a set of underlying chat models to respond to a given prompt. |
2828
| [computer-use-preview](#computer-use-preview) | An experimental model trained for use with the Responses API computer use tool. |
29-
| [GPT-4.5 Preview](#gpt-45-preview) |The latest GPT model that excels at diverse text and image tasks. |
3029
| [o-series models](#o-series-models) |[Reasoning models](../how-to/reasoning.md) with advanced problem-solving and increased focus and capability. |
3130
| [GPT-4o & GPT-4o mini & GPT-4 Turbo](#gpt-4o-and-gpt-4-turbo) | The latest most capable Azure OpenAI models with multimodal versions, which can accept both text and images as input. |
3231
| [GPT-4](#gpt-4) | A set of models that improve on GPT-3.5 and can understand and generate natural language and code. |
@@ -98,24 +97,6 @@ Once access has been granted, you will need to create a deployment for the model
9897
| --- | :--- |:--- |:---|:---: |
9998
| `computer-use-preview` (2025-03-11) | Specialized model for use with the [Responses API](../how-to/responses.md) computer use tool <br> <br>-Tools <br>-Streaming<br>-Text(input/output)<br>- Image(input) | 8,192 | 1,024 | October 2023 |
10099

101-
102-
## GPT-4.5 Preview
103-
104-
### Region availability
105-
106-
| Model | Region |
107-
|---|---|
108-
| `gpt-4.5-preview` | See the [models table](#model-summary-table-and-region-availability).|
109-
110-
### Capabilities
111-
112-
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
113-
| --- | :--- |:--- |:---|:---: |
114-
| `gpt-4.5-preview` (2025-02-27) <br> **GPT-4.5 Preview** | [GPT 4.1](#gpt-41-series) is the recommended replacement for this model. Excels at diverse text and image tasks. <br>- Structured outputs <br>- Prompt caching <br>- Tools <br>- Streaming<br>- Text(input/output)<br>- Image(input) | 128,000 | 16,384 | October 2023 |
115-
116-
> [!NOTE]
117-
> It is expected behavior that the model cannot answer questions about itself. If you want to know when the knowledge cutoff for the model's training data is, or other details about the model you should refer to the model documentation above.
118-
119100
## o-series models
120101

121102
The Azure OpenAI o<sup>&#42;</sup> series models are specifically designed to tackle reasoning and problem-solving tasks with increased focus and capability. These models spend more time processing and understanding the user's request, making them exceptionally strong in areas like science, coding, and math compared to previous iterations.
@@ -173,12 +154,6 @@ GPT-4 Turbo is a large multimodal model (accepting text or image inputs and gene
173154

174155
GPT-4 is the predecessor to GPT-4 Turbo. Both the GPT-4 and GPT-4 Turbo models have a base model name of `gpt-4`. You can distinguish between the GPT-4 and Turbo models by examining the model version.
175156

176-
- `gpt-4` **Version** `0314`
177-
- `gpt-4` **Version** `0613`
178-
- `gpt-4-32k` **Version** `0613`
179-
180-
You can see the token context length supported by each model in the [model summary table](#model-summary-table-and-region-availability).
181-
182157
## GPT-4 and GPT-4 Turbo models
183158

184159
- These models can only be used with the Chat Completion API.
@@ -192,10 +167,6 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
192167
|`gpt-4o-mini` (2024-07-18) <br> **GPT-4o mini** | **Latest small GA model** <br> - Fast, inexpensive, capable model ideal for replacing GPT-3.5 Turbo series models. <br> - Text, image processing <br>- JSON Mode <br> - parallel function calling | Input: 128,000 <br> Output: 16,384 | October 2023 |
193168
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| October 2023 |
194169
| `gpt-4` (turbo-2024-04-09) <br>**GPT-4 Turbo with Vision** | **New GA model** <br> - Replacement for all previous GPT-4 preview models (`vision-preview`, `1106-Preview`, `0125-Preview`). <br> - [**Feature availability**](#gpt-4o-and-gpt-4-turbo) is currently different depending on method of input, and deployment type. | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
195-
| `gpt-4-32k` (0613) | **Older GA model** <br> - Basic function calling with tools | 32,768 | Sep 2021 |
196-
| `gpt-4` (0613) | **Older GA model** <br> - Basic function calling with tools | 8,192 | Sep 2021 |
197-
| `gpt-4-32k`(0314) | **Older GA model** <br> - [Retirement information](./model-retirements.md#current-models) | 32,768 | Sep 2021 |
198-
| `gpt-4` (0314) | **Older GA model** <br> - [Retirement information](./model-retirements.md#current-models) | 8,192 | Sep 2021 |
199170

200171
> [!CAUTION]
201172
> We don't recommend using preview models in production. We will upgrade all deployments of preview models to either future preview versions or to the latest stable GA version. Models that are designated preview don't follow the standard Azure OpenAI model lifecycle.
@@ -335,9 +306,7 @@ All deployments can perform the exact same inference operations, however the bil
335306
[!INCLUDE [Standard Global](../includes/model-matrix/standard-global.md)]
336307

337308
> [!NOTE]
338-
> `o1-mini` is currently available to all customers for global standard deployment.
339-
>
340-
> Select customers were granted standard (regional) deployment access to `o1-mini` as part of the `o1-preview` limited access release. At this time access to `o1-mini` standard (regional) deployments is not being expanded.
309+
> `o3-deep-research` is currently only available with Azure AI Foundry Agent Service. To learn more, see the [Deep Research tool guidance](/azure/ai-foundry/agents/how-to/tools/deep-research).
341310
342311
# [Global Provisioned Managed](#tab/global-ptum)
343312

@@ -416,15 +385,6 @@ This table doesn't include fine-tuning regional availability information. Consu
416385
417386
### GPT-4 and GPT-4 Turbo model availability
418387

419-
#### Select customer access
420-
421-
In addition to the regions above which are available to all Azure OpenAI customers, some select preexisting customers have been granted access to versions of GPT-4 in additional regions:
422-
423-
| Model | Region |
424-
|---|:---|
425-
| `gpt-4` (0314) <br> `gpt-4-32k` (0314) | East US <br> France Central <br> South Central US <br> UK South |
426-
| `gpt-4` (0613) <br> `gpt-4-32k` (0613) | East US <br> East US 2 <br> Japan East <br> UK South |
427-
428388
### GPT-3.5 models
429389

430390
See [model versions](../concepts/model-versions.md) to learn about how Azure OpenAI handles model version upgrades, and [working with models](../how-to/working-with-models.md) to learn how to view and configure the model version settings of your GPT-3.5 Turbo deployments.

articles/ai-foundry/openai/how-to/fine-tune-test.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ deploy_params = {'api-version': "2024-10-21"}
5252
deploy_headers = {'Authorization': 'Bearer {}'.format(token), 'Content-Type': 'application/json'}
5353

5454
deploy_data = {
55-
"sku": {"name": "developer", "capacity": 50},
55+
"sku": {"name": "developertier", "capacity": 50},
5656
"properties": {
5757
"model": {
5858
"format": "OpenAI",
@@ -94,7 +94,7 @@ curl -X POST "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resource
9494
-H "Authorization: Bearer <TOKEN>" \
9595
-H "Content-Type: application/json" \
9696
-d '{
97-
"sku": {"name": "developer", "capacity": 50},
97+
"sku": {"name": "developertier", "capacity": 50},
9898
"properties": {
9999
"model": {
100100
"format": "OpenAI",

articles/ai-foundry/openai/how-to/function-calling.md

Lines changed: 0 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,6 @@ At a high level you can break down working with functions into three steps:
3939
* `gpt-4o` (`2024-08-06`)
4040
* `gpt-4o` (`2024-11-20`)
4141
* `gpt-4o-mini` (`2024-07-18`)
42-
* `gpt-4.5-preview` (`2025-02-27`)
4342
* `gpt-4.1` (`2025-04-14`)
4443
* `gpt-4.1-mini` (`2025-04-14`)
4544

@@ -55,10 +54,6 @@ Support for parallel function was first added in API version [`2023-12-01-previe
5554
* `gpt-4.1-nano` (`2025-04-14`)
5655
* `o3-mini` (`2025-01-31`)
5756
* `o1` (`2024-12-17`)
58-
* `gpt-4` (`0613`)
59-
* `gpt-4-32k` (`0613`)
60-
* `gpt-35-turbo-16k` (`0613`)
61-
* `gpt-35-turbo` (`0613`)
6257

6358
> [!NOTE]
6459
> The `tool_choice` parameter is now supported with `o3-mini` and `o1`. For more information on what parameters are supported with the o-series models see, the [reasoning models guide](./reasoning.md).

articles/ai-foundry/openai/how-to/structured-outputs.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,6 @@ Structured outputs make a model follow a [JSON Schema](https://json-schema.org/o
2727

2828
- `codex-mini` version `2025-05-16`
2929
- `o3-pro` version `2025-06-10`
30-
- `gpt-4.5-preview` version `2025-02-27`
3130
- `o3-mini` version `2025-01-31`
3231
- `o1` version: `2024-12-17`
3332
- `gpt-4o-mini` version: `2024-07-18`

0 commit comments

Comments
 (0)