Skip to content

Commit a4891a6

Browse files
authored
Merge pull request #273866 from MicrosoftDocs/release-april-2024-openai
[Azure OpenAI] [Release Branch] Main tracking branch
2 parents a0f9798 + 44aa4f9 commit a4891a6

File tree

15 files changed

+199
-94
lines changed

15 files changed

+199
-94
lines changed

articles/ai-services/openai/concepts/model-retirements.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -66,15 +66,17 @@ These models are currently available for use in Azure OpenAI Service.
6666
| `gpt-35-turbo` | 0125 | No earlier than Feb 22, 2025 |
6767
| `gpt-4`<br>`gpt-4-32k` | 0314 | No earlier than July 13, 2024 |
6868
| `gpt-4`<br>`gpt-4-32k` | 0613 | No earlier than Sep 30, 2024 |
69-
| `gpt-4` | 1106-preview | To be upgraded to a stable version with date to be announced |
70-
| `gpt-4` | 0125-preview | To be upgraded to a stable version with date to be announced |
71-
| `gpt-4` | vision-preview | To be upgraded to a stable version with date to be announced |
69+
| `gpt-4` | 1106-preview | To be upgraded to `gpt-4` Version: `2024-04-09`, starting on June 10, 2024, or later **<sup>1</sup>** |
70+
| `gpt-4` | 0125-preview |To be upgraded to `gpt-4` Version: `2024-04-09`, starting on June 10, 2024, or later **<sup>1</sup>** |
71+
| `gpt-4` | vision-preview | To be upgraded to `gpt-4` Version: `2024-04-09`, starting on June 10, 2024, or later **<sup>1</sup>** |
7272
| `gpt-3.5-turbo-instruct` | 0914 | No earlier than Sep 14, 2025 |
7373
| `text-embedding-ada-002` | 2 | No earlier than April 3, 2025 |
7474
| `text-embedding-ada-002` | 1 | No earlier than April 3, 2025 |
7575
| `text-embedding-3-small` | | No earlier than Feb 2, 2025 |
7676
| `text-embedding-3-large` | | No earlier than Feb 2, 2025 |
7777

78+
**<sup>1</sup>** We will notify all customers with these preview deployments at least two weeks before the start of the upgrades. We will publish an upgrade schedule detailing the order of regions and model versions that we will follow during the upgrades, and link to that schedule from here.
79+
7880

7981
## Deprecated models
8082

articles/ai-services/openai/concepts/models.md

Lines changed: 54 additions & 43 deletions
Original file line numberDiff line numberDiff line change
@@ -18,24 +18,64 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
1818

1919
| Models | Description |
2020
|--|--|
21-
| [GPT-4](#gpt-4-and-gpt-4-turbo-preview) | A set of models that improve on GPT-3.5 and can understand and generate natural language and code. |
21+
| [GPT-4 Turbo 🆕](#gpt-4-turbo) | The latest most capable Azure OpenAI models with multimodal versions which can accept both text and images as input. |
22+
| [GPT-4](#gpt-4) | A set of models that improve on GPT-3.5 and can understand and generate natural language and code. |
2223
| [GPT-3.5](#gpt-35) | A set of models that improve on GPT-3 and can understand and generate natural language and code. |
2324
| [Embeddings](#embeddings-models) | A set of models that can convert text into numerical vector form to facilitate text similarity. |
2425
| [DALL-E](#dall-e-models) | A series of models that can generate original images from natural language. |
2526
| [Whisper](#whisper-models) | A series of models in preview that can transcribe and translate speech to text. |
2627
| [Text to speech](#text-to-speech-models-preview) (Preview) | A series of models in preview that can synthesize text to speech. |
2728

28-
## GPT-4 and GPT-4 Turbo Preview
29+
## GPT-4 Turbo
2930

30-
GPT-4 is a large multimodal model (accepting text or image inputs and generating text) that can solve difficult problems with greater accuracy than any of OpenAI's previous models. Like GPT-3.5 Turbo, GPT-4 is optimized for chat and works well for traditional completions tasks. Use the Chat Completions API to use GPT-4. To learn more about how to interact with GPT-4 and the Chat Completions API check out our [in-depth how-to](../how-to/chatgpt.md).
31+
GPT-4 Turbo is a large multimodal model (accepting text or image inputs and generating text) that can solve difficult problems with greater accuracy than any of OpenAI's previous models. Like GPT-3.5 Turbo, and older GPT-4 models GPT-4 Turbo is optimized for chat and works well for traditional completions tasks.
3132

32-
GPT-4 Turbo with Vision is the version of GPT-4 that accepts image inputs. It is available as the `vision-preview` model of `gpt-4`.
33+
[!INCLUDE [GPT-4 Turbo](../includes/gpt-4-turbo.md)]
3334

34-
- `gpt-4`
35-
- `gpt-4-32k`
35+
## GPT-4
36+
37+
GPT-4 is the predecessor to GPT-4 Turbo. Both the GPT-4 and GPT-4 Turbo models have a base model name of `gpt-4`. You can distinguish between the GPT-4 and Turbo models by examining the model version.
38+
39+
- `gpt-4` **Version** `0314`
40+
- `gpt-4` **Version** `0613`
41+
- `gpt-4-32k` **Version** `0613`
3642

3743
You can see the token context length supported by each model in the [model summary table](#model-summary-table-and-region-availability).
3844

45+
## GPT-4 and GPT-4 Turbo models
46+
47+
- These models can only be used with the Chat Completion API.
48+
49+
See [model versions](../concepts/model-versions.md) to learn about how Azure OpenAI Service handles model version upgrades, and [working with models](../how-to/working-with-models.md) to learn how to view and configure the model version settings of your GPT-4 deployments.
50+
51+
| Model ID | Max Request (tokens) | Training Data (up to) |
52+
| --- | :--- | :---: |
53+
| `gpt-4` (0314) | 8,192 | Sep 2021 |
54+
| `gpt-4-32k`(0314) | 32,768 | Sep 2021 |
55+
| `gpt-4` (0613) | 8,192 | Sep 2021 |
56+
| `gpt-4-32k` (0613) | 32,768 | Sep 2021 |
57+
| `gpt-4` (1106-Preview)**<sup>1</sup>**<br>**GPT-4 Turbo Preview** | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
58+
| `gpt-4` (0125-Preview)**<sup>1</sup>**<br>**GPT-4 Turbo Preview** | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
59+
| `gpt-4` (vision-preview)**<sup>2</sup>**<br>**GPT-4 Turbo with Vision Preview** | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
60+
| `gpt-4` (turbo-2024-04-09) 🆕 <br>**GPT-4 Turbo with Vision GA** | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
61+
62+
**<sup>1</sup>** GPT-4 Turbo Preview = `gpt-4` (0125-Preview) or `gpt-4` (1106-Preview). To deploy this model, under **Deployments** select model **gpt-4**. Under version select (0125-Preview) or (1106-Preview).
63+
64+
**<sup>2</sup>** GPT-4 Turbo with Vision Preview = `gpt-4` (vision-preview). To deploy this model, under **Deployments** select model **gpt-4**. For **Model version** select **vision-preview**.
65+
66+
> [!CAUTION]
67+
> We don't recommend using preview models in production. We will upgrade all deployments of preview models to future preview versions and a stable version. Models designated preview do not follow the standard Azure OpenAI model lifecycle.
68+
69+
> [!NOTE]
70+
> Version `0314` of `gpt-4` and `gpt-4-32k` will be retired no earlier than July 5, 2024. Version `0613` of `gpt-4` and `gpt-4-32k` will be retired no earlier than September 30, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.
71+
72+
- GPT-4 version 0125-preview is an updated version of the GPT-4 Turbo preview previously released as version 1106-preview.
73+
- GPT-4 version 0125-preview completes tasks such as code generation more completely compared to gpt-4-1106-preview. Because of this, depending on the task, customers may find that GPT-4-0125-preview generates more output compared to the gpt-4-1106-preview. We recommend customers compare the outputs of the new model. GPT-4-0125-preview also addresses bugs in gpt-4-1106-preview with UTF-8 handling for non-English languages. GPT-4 version `turbo-2024-04-09` is the latest GA release and replaces `0125-Preview`, `1106-preview`, and `vision-preview`.
74+
75+
> [!IMPORTANT]
76+
>
77+
> - `gpt-4` versions 1106-Preview and 0125-Preview will be upgraded with a stable version of `gpt-4` in the future. The deployment upgrade of `gpt-4` 1106-Preview to `gpt-4` 0125-Preview scheduled for March 8, 2024 is no longer taking place. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "Auto-update to default" and "Upgrade when expired" will start to be upgraded after the stable version is released. For each deployment, a model version upgrade takes place with no interruption in service for API calls. Upgrades are staged by region and the full upgrade process is expected to take 2 weeks. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "No autoupgrade" will not be upgraded and will stop operating when the preview version is upgraded in the region.
78+
3979
## GPT-3.5
4080

4181
GPT-3.5 models can understand and generate natural language or code. The most capable and cost effective model in the GPT-3.5 family is GPT-3.5 Turbo, which has been optimized for chat and works well for traditional completions tasks as well. GPT-3.5 Turbo is available for use with the Chat Completions API. GPT-3.5 Turbo Instruct has similar capabilities to `text-davinci-003` using the Completions API instead of the Chat Completions API. We recommend using GPT-3.5 Turbo and GPT-3.5 Turbo Instruct over [legacy GPT-3.5 and GPT-3 models](./legacy-models.md).
@@ -86,58 +126,29 @@ You can also use the OpenAI text to speech voices via Azure AI Speech. To learn
86126
## Model summary table and region availability
87127

88128
> [!NOTE]
89-
> This article only covers model/region availability that applies to all Azure OpenAI customers with deployment types of **Standard**. Some select customers have access to model/region combinations that are not listed in the unified table below. These tables also do not apply to customers using only **Provisioned** deployment types which have their own unique model/region availability matrix. For more information on **Provisioned** deployments refer to our [Provisioned guidance](./provisioned-throughput.md).
129+
> This article primarily covers model/region availability that applies to all Azure OpenAI customers with deployment types of **Standard**. Some select customers have access to model/region combinations that are not listed in the unified table below. For more information on Provisioned deployments, see our [Provisioned guidance](./provisioned-throughput.md).
90130
91131
### Standard deployment model availability
92132

93133
[!INCLUDE [Standard Models](../includes/model-matrix/standard-models.md)]
94134

95-
This table does not include fine-tuning regional availability, consult the dedicated [fine-tuning section](#fine-tuning-models) for this information.
135+
This table doesn't include fine-tuning regional availability, consult the dedicated [fine-tuning section](#fine-tuning-models) for this information.
96136

97137
### Standard deployment model quota
98138

99139
[!INCLUDE [Quota](../includes/model-matrix/quota.md)]
100140

101-
### GPT-4 and GPT-4 Turbo Preview models
102-
103-
GPT-4, GPT-4-32k, and GPT-4 Turbo with Vision are now available to all Azure OpenAI Service customers. Availability varies by region. If you don't see GPT-4 in your region, please check back later.
104-
105-
These models can only be used with the Chat Completion API.
106-
107-
GPT-4 version 0314 is the first version of the model released. Version 0613 is the second version of the model and adds function calling support.
141+
### Provisioned deployment model availability
108142

109-
See [model versions](../concepts/model-versions.md) to learn about how Azure OpenAI Service handles model version upgrades, and [working with models](../how-to/working-with-models.md) to learn how to view and configure the model version settings of your GPT-4 deployments.
143+
[!INCLUDE [Provisioned](../includes/model-matrix/provisioned-models.md)]
110144

111-
> [!NOTE]
112-
> Version `0314` of `gpt-4` and `gpt-4-32k` will be retired no earlier than July 5, 2024. Version `0613` of `gpt-4` and `gpt-4-32k` will be retired no earlier than September 30, 2024. See [model updates](../how-to/working-with-models.md#model-updates) for model upgrade behavior.
145+
### How do I get access to Provisioned?
113146

114-
GPT-4 version 0125-preview is an updated version of the GPT-4 Turbo preview previously released as version 1106-preview. GPT-4 version 0125-preview completes tasks such as code generation more completely compared to gpt-4-1106-preview. Because of this, depending on the task, customers may find that GPT-4-0125-preview generates more output compared to the gpt-4-1106-preview. We recommend customers compare the outputs of the new model. GPT-4-0125-preview also addresses bugs in gpt-4-1106-preview with UTF-8 handling for non-English languages.
147+
You need to speak with your Microsoft sales/account team to acquire provisioned throughput. If you don't have a sales/account team, unfortunately at this time, you cannot purchase provisioned throughput.
115148

116-
> [!IMPORTANT]
117-
>
118-
> - `gpt-4` versions 1106-Preview and 0125-Preview will be upgraded with a stable version of `gpt-4` in the future. The deployment upgrade of `gpt-4` 1106-Preview to `gpt-4` 0125-Preview scheduled for March 8, 2024 is no longer taking place. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "Auto-update to default" and "Upgrade when expired" will start to be upgraded after the stable version is released. For each deployment, a model version upgrade takes place with no interruption in service for API calls. Upgrades are staged by region and the full upgrade process is expected to take 2 weeks. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "No autoupgrade" will not be upgraded and will stop operating when the preview version is upgraded in the region.
119-
120-
| Model ID | Max Request (tokens) | Training Data (up to) |
121-
| --- | :--- | :---: |
122-
| `gpt-4` (0314) | 8,192 | Sep 2021 |
123-
| `gpt-4-32k`(0314) | 32,768 | Sep 2021 |
124-
| `gpt-4` (0613) | 8,192 | Sep 2021 |
125-
| `gpt-4-32k` (0613) | 32,768 | Sep 2021 |
126-
| `gpt-4` (1106-Preview)**<sup>1</sup>**<br>**GPT-4 Turbo Preview** | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
127-
| `gpt-4` (0125-Preview)**<sup>1</sup>**<br>**GPT-4 Turbo Preview** | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
128-
| `gpt-4` (vision-preview)**<sup>2</sup>**<br>**GPT-4 Turbo with Vision Preview** | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
129-
130-
**<sup>1</sup>** GPT-4 Turbo Preview = `gpt-4` (0125-Preview) or `gpt-4` (1106-Preview). To deploy this model, under **Deployments** select model **gpt-4**. Under version select (0125-Preview) or (1106-Preview).
131-
132-
**<sup>2</sup>** GPT-4 Turbo with Vision Preview = `gpt-4` (vision-preview). To deploy this model, under **Deployments** select model **gpt-4**. For **Model version** select **vision-preview**.
133-
134-
> [!CAUTION]
135-
> We don't recommend using preview models in production. We will upgrade all deployments of preview models to future preview versions and a stable version. Models designated preview do not follow the standard Azure OpenAI model lifecycle.
136-
137-
> [!NOTE]
138-
> Regions where GPT-4 (0314) & (0613) are listed as available have access to both the 8K and 32K versions of the model
149+
For more information on Provisioned deployments, see our [Provisioned guidance](./provisioned-throughput.md).
139150

140-
### GPT-4 and GPT-4 Turbo Preview model availability
151+
### GPT-4 and GPT-4 Turbo model availability
141152

142153
#### Public cloud regions
143154

articles/ai-services/openai/concepts/provisioned-throughput.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Azure OpenAI Service provisioned throughput
33
description: Learn about provisioned throughput and Azure OpenAI.
44
ms.service: azure-ai-openai
55
ms.topic: conceptual
6-
ms.date: 1/16/2024
6+
ms.date: 04/29/2024
77
manager: nitinme
88
author: mrbullwinkle #ChrisHMSFT
99
ms.author: mbullwin #chrhoder
@@ -40,6 +40,10 @@ An Azure OpenAI Deployment is a unit of management for a specific OpenAI Model.
4040

4141
You need to speak with your Microsoft sales/account team to acquire provisioned throughput. If you don't have a sales/account team, unfortunately at this time, you cannot purchase provisioned throughput.
4242

43+
## What models and regions are available for provisioned throughput?
44+
45+
[!INCLUDE [Provisioned](../includes/model-matrix/provisioned-models.md)]
46+
4347
## Key concepts
4448

4549
### Provisioned throughput units

articles/ai-services/openai/gpt-v-quickstart.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,12 @@ zone_pivot_groups: openai-quickstart-gpt-v
1515

1616
# Quickstart: Use images in your AI chats
1717

18+
Get started using GPT-4 Turbo with images with the Azure OpenAI Service.
19+
20+
## GPT-4 Turbo model upgrade
21+
22+
[!INCLUDE [GPT-4 Turbo](./includes/gpt-4-turbo.md)]
23+
1824
::: zone pivot="programming-language-studio"
1925

2026
[!INCLUDE [Studio quickstart](includes/gpt-v-studio.md)]

articles/ai-services/openai/how-to/gpt-with-vision.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,10 @@ The GPT-4 Turbo with Vision model answers general questions about what's present
2020
> [!TIP]
2121
> To use GPT-4 Turbo with Vision, you call the Chat Completion API on a GPT-4 Turbo with Vision model that you have deployed. If you're not familiar with the Chat Completion API, see the [GPT-4 Turbo & GPT-4 how-to guide](/azure/ai-services/openai/how-to/chatgpt?tabs=python&pivots=programming-language-chat-completions).
2222
23+
## GPT-4 Turbo model upgrade
24+
25+
[!INCLUDE [GPT-4 Turbo](../includes/gpt-4-turbo.md)]
26+
2327
## Call the Chat Completion APIs
2428

2529
The following command shows the most basic way to use the GPT-4 Turbo with Vision model with code. If this is your first time using these models programmatically, we recommend starting with our [GPT-4 Turbo with Vision quickstart](../gpt-v-quickstart.md).

articles/ai-services/openai/how-to/reproducible-output.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -25,8 +25,8 @@ Reproducible output is only currently supported with the following:
2525

2626
* `gpt-35-turbo` (1106) - [region availability](../concepts/models.md#gpt-35-turbo-model-availability)
2727
* `gpt-35-turbo` (0125) - [region availability](../concepts/models.md#gpt-35-turbo-model-availability)
28-
* `gpt-4` (1106-Preview) - [region availability](../concepts/models.md#gpt-4-and-gpt-4-turbo-preview-model-availability)
29-
* `gpt-4` (0125-Preview) - [region availability](../concepts/models.md#gpt-4-and-gpt-4-turbo-preview-model-availability)
28+
* `gpt-4` (1106-Preview) - [region availability](../concepts/models.md#gpt-4-and-gpt-4-turbo-model-availability)
29+
* `gpt-4` (0125-Preview) - [region availability](../concepts/models.md#gpt-4-and-gpt-4-turbo-model-availability)
3030

3131
### API Version
3232

Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
---
2+
title: GPT-4 Turbo general availability
3+
titleSuffix: Azure OpenAI Service
4+
description: Information on GPT-4 Turbo model behavior and limitations
5+
manager: nitinme
6+
ms.service: azure-ai-openai
7+
ms.topic: include
8+
ms.date: 04/29/2024
9+
---
10+
11+
The latest GA release of GPT-4 Turbo is:
12+
13+
- `gpt-4` **Version:** `turbo-2024-04-09`
14+
15+
This is the replacement for the following preview models:
16+
17+
- `gpt-4` **Version:** `1106-Preview`
18+
- `gpt-4` **Version:** `0125-Preview`
19+
- `gpt-4` **Version:** `vision-preview`
20+
21+
### Differences between OpenAI and Azure OpenAI GPT-4 Turbo with Vision GA model
22+
23+
- OpenAI's version of the latest `0409` turbo model supports JSON mode and function calling for all inference requests.
24+
- Azure OpenAI's version of the latest `turbo-2024-04-09` currently doesn't support the use of JSON mode and function calling when making inference requests with image (vision) input. Text based input requests do support JSON mode and function calling.
25+
26+
### Differences from gpt-4 vision-preview
27+
28+
- Azure AI specific Vision enhancements integration with GPT-4 Turbo with Vision aren't supported for `gpt-4` **Version:** `turbo-2024-04-09`. This includes Optical Character Recognition (OCR), object grounding, video prompts, and improved handling of your data with images.
29+
30+
### Region availability
31+
32+
For information on model regional availability consult the [model matrix](../concepts/models.md#gpt-4-and-gpt-4-turbo-model-availability).
33+
34+
### Deploying GPT-4 Turbo with Vision GA
35+
36+
To deploy the GA model from the Studio UI, select `GPT-4` and then choose the `turbo-2024-04-09` version from the dropdown menu. The default quota for the `gpt-4-turbo-2024-04-09` model will be the same as current quota for GPT-4-Turbo. See the [regional quota limits.](../concepts/models.md#standard-deployment-model-quota)

0 commit comments

Comments
 (0)