Skip to content

Commit 3e432ee

Browse files
Merge pull request #26 from mrbullwinkle/mrb_08_28_2024_s_outputs_001
[Azure OpenAI] [Release branch] GPT-4o structured outputs
2 parents ae864ff + bdf185e commit 3e432ee

File tree

1 file changed

+15
-20
lines changed
  • articles/ai-services/openai/concepts

1 file changed

+15
-20
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 15 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 08/14/2024
7+
ms.date: 08/28/2024
88
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
@@ -26,24 +26,6 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
2626
| [Whisper](#whisper-models) | A series of models in preview that can transcribe and translate speech to text. |
2727
| [Text to speech](#text-to-speech-models-preview) (Preview) | A series of models in preview that can synthesize text to speech. |
2828

29-
## Early access playground (preview)
30-
31-
On August 6, 2024, OpenAI [announced](https://openai.com/index/introducing-structured-outputs-in-the-api/) the latest version of their flagship GPT-4o model version `2024-08-06`. GPT-4o `2024-08-06` has all the capabilities of the previous version as well as:
32-
33-
* An enhanced ability to support complex structured outputs.
34-
* Max output tokens have been increased from 4,096 to 16,384.
35-
36-
Azure customers can test out GPT-4o `2024-08-06` today in the new AI Studio early access playground (preview).
37-
38-
Unlike the previous early access playground, the AI Studio early access playground (preview) does not require you to have a resource in a specific region.
39-
40-
> [!NOTE]
41-
> Prompts and completions made through the early access playground (preview) may be processed in any Azure OpenAI region, and are currently subject to a 10 request per minute per Azure subscription limit. This limit may change in the future.
42-
>
43-
> Azure OpenAI Service abuse monitoring is enabled for all early access playground users even if approved for modification; default content filters are enabled and cannot be modified.
44-
45-
To test out GPT-4o `2024-08-06`, sign-in to the Azure AI early access playground (preview) using this [link](https://aka.ms/oai/docs/earlyaccessplayground).
46-
4729
## GPT-4o and GPT-4 Turbo
4830

4931
GPT-4o integrates text and images in a single model, enabling it to handle multiple data types simultaneously. This multimodal approach enhances accuracy and responsiveness in human-computer interactions. GPT-4o matches GPT-4 Turbo in English text and coding tasks while offering superior performance in non-English languages and vision tasks, setting new benchmarks for AI capabilities.
@@ -56,6 +38,7 @@ You need to [create](../how-to/create-resource.md) or use an existing resource i
5638

5739
When your resource is created, you can [deploy](../how-to/create-resource.md#deploy-a-model) the GPT-4o models. If you are performing a programmatic deployment, the **model** names are:
5840

41+
- `gpt-4o` **Version** `2024-08-06`
5942
- `gpt-4o`, **Version** `2024-05-13`
6043
- `gpt-4o-mini` **Version** `2024-07-18`
6144

@@ -83,8 +66,9 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
8366

8467
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
8568
| --- | :--- |:--- |:---: |
69+
|`gpt-4o` (2024-08-06) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Structured outputs<br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
8670
|`gpt-4o-mini` (2024-07-18) <br> **GPT-4o mini** | **Latest small GA model** <br> - Fast, inexpensive, capable model ideal for replacing GPT-3.5 Turbo series models. <br> - Text, image processing <br>- JSON Mode <br> - parallel function calling | Input: 128,000 <br> Output: 16,384 | Oct 2023 |
87-
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
71+
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
8872
| `gpt-4` (turbo-2024-04-09) <br>**GPT-4 Turbo with Vision** | **New GA model** <br> - Replacement for all previous GPT-4 preview models (`vision-preview`, `1106-Preview`, `0125-Preview`). <br> - [**Feature availability**](#gpt-4o-and-gpt-4-turbo) is currently different depending on method of input, and deployment type. | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
8973
| `gpt-4` (0125-Preview)*<br>**GPT-4 Turbo Preview** | **Preview Model** <br> -Replaces 1106-Preview <br>- Better code generation performance <br> - Reduces cases where the model doesn't complete a task <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
9074
| `gpt-4` (vision-preview)<br>**GPT-4 Turbo with Vision Preview** | **Preview model** <br> - Accepts text and image input. <br> - Supports enhancements <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
@@ -188,6 +172,17 @@ For more information on Provisioned deployments, see our [Provisioned guidance](
188172

189173
### Global standard model availability
190174

175+
`gpt-4o` **Version:** `2024-08-06`
176+
177+
**Supported regions:**
178+
- eastus
179+
- eastus2
180+
- northcentralus
181+
- southcentralus
182+
- swedencentral
183+
- westus
184+
- westus3
185+
191186
`gpt-4o` **Version:** `2024-05-13`
192187

193188
**Supported regions:**

0 commit comments

Comments
 (0)