Skip to content

Commit 413694e

Browse files
authored
Merge pull request #35 from MicrosoftDocs/release-2024-openai-aug
[Azure OpenAI] [Release branch to main tracking PR]
2 parents 810cdc8 + 60323ec commit 413694e

File tree

4 files changed

+767
-22
lines changed

4 files changed

+767
-22
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 15 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 08/14/2024
7+
ms.date: 08/28/2024
88
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
@@ -26,24 +26,6 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
2626
| [Whisper](#whisper-models) | A series of models in preview that can transcribe and translate speech to text. |
2727
| [Text to speech](#text-to-speech-models-preview) (Preview) | A series of models in preview that can synthesize text to speech. |
2828

29-
## Early access playground (preview)
30-
31-
On August 6, 2024, OpenAI [announced](https://openai.com/index/introducing-structured-outputs-in-the-api/) the latest version of their flagship GPT-4o model version `2024-08-06`. GPT-4o `2024-08-06` has all the capabilities of the previous version as well as:
32-
33-
* An enhanced ability to support complex structured outputs.
34-
* Max output tokens have been increased from 4,096 to 16,384.
35-
36-
Azure customers can test out GPT-4o `2024-08-06` today in the new AI Studio early access playground (preview).
37-
38-
Unlike the previous early access playground, the AI Studio early access playground (preview) does not require you to have a resource in a specific region.
39-
40-
> [!NOTE]
41-
> Prompts and completions made through the early access playground (preview) may be processed in any Azure OpenAI region, and are currently subject to a 10 request per minute per Azure subscription limit. This limit may change in the future.
42-
>
43-
> Azure OpenAI Service abuse monitoring is enabled for all early access playground users even if approved for modification; default content filters are enabled and cannot be modified.
44-
45-
To test out GPT-4o `2024-08-06`, sign-in to the Azure AI early access playground (preview) using this [link](https://aka.ms/oai/docs/earlyaccessplayground).
46-
4729
## GPT-4o and GPT-4 Turbo
4830

4931
GPT-4o integrates text and images in a single model, enabling it to handle multiple data types simultaneously. This multimodal approach enhances accuracy and responsiveness in human-computer interactions. GPT-4o matches GPT-4 Turbo in English text and coding tasks while offering superior performance in non-English languages and vision tasks, setting new benchmarks for AI capabilities.
@@ -56,6 +38,7 @@ You need to [create](../how-to/create-resource.md) or use an existing resource i
5638

5739
When your resource is created, you can [deploy](../how-to/create-resource.md#deploy-a-model) the GPT-4o models. If you are performing a programmatic deployment, the **model** names are:
5840

41+
- `gpt-4o` **Version** `2024-08-06`
5942
- `gpt-4o`, **Version** `2024-05-13`
6043
- `gpt-4o-mini` **Version** `2024-07-18`
6144

@@ -83,8 +66,9 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
8366

8467
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
8568
| --- | :--- |:--- |:---: |
69+
|`gpt-4o` (2024-08-06) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Structured outputs<br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
8670
|`gpt-4o-mini` (2024-07-18) <br> **GPT-4o mini** | **Latest small GA model** <br> - Fast, inexpensive, capable model ideal for replacing GPT-3.5 Turbo series models. <br> - Text, image processing <br>- JSON Mode <br> - parallel function calling | Input: 128,000 <br> Output: 16,384 | Oct 2023 |
87-
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
71+
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
8872
| `gpt-4` (turbo-2024-04-09) <br>**GPT-4 Turbo with Vision** | **New GA model** <br> - Replacement for all previous GPT-4 preview models (`vision-preview`, `1106-Preview`, `0125-Preview`). <br> - [**Feature availability**](#gpt-4o-and-gpt-4-turbo) is currently different depending on method of input, and deployment type. | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
8973
| `gpt-4` (0125-Preview)*<br>**GPT-4 Turbo Preview** | **Preview Model** <br> -Replaces 1106-Preview <br>- Better code generation performance <br> - Reduces cases where the model doesn't complete a task <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
9074
| `gpt-4` (vision-preview)<br>**GPT-4 Turbo with Vision Preview** | **Preview model** <br> - Accepts text and image input. <br> - Supports enhancements <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
@@ -188,6 +172,17 @@ For more information on Provisioned deployments, see our [Provisioned guidance](
188172

189173
### Global standard model availability
190174

175+
`gpt-4o` **Version:** `2024-08-06`
176+
177+
**Supported regions:**
178+
- eastus
179+
- eastus2
180+
- northcentralus
181+
- southcentralus
182+
- swedencentral
183+
- westus
184+
- westus3
185+
191186
`gpt-4o` **Version:** `2024-05-13`
192187

193188
**Supported regions:**

articles/ai-services/openai/how-to/json-mode.md

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ services: cognitive-services
66
manager: nitinme
77
ms.service: azure-ai-openai
88
ms.topic: how-to
9-
ms.date: 12/05/2023
9+
ms.date: 08/29/2024
1010
author: mrbullwinkle
1111
ms.author: mbullwin
1212
recommendations: false
@@ -17,6 +17,9 @@ recommendations: false
1717

1818
JSON mode allows you to set the models response format to return a valid JSON object as part of a chat completion. While generating valid JSON was possible previously, there could be issues with response consistency that would lead to invalid JSON objects being generated.
1919

20+
> [!NOTE]
21+
> While JSON mode is still supported, when possible we recommend using [structured outputs](./structured-outputs.md). Like JSON mode structured outputs generates valid JSON, but with the added benefit that you can constrain the model to use a specific JSON schema.
22+
2023
## JSON mode support
2124

2225
JSON mode is only currently supported with the following models:
@@ -27,6 +30,8 @@ JSON mode is only currently supported with the following models:
2730
* `gpt-35-turbo` (0125)
2831
* `gpt-4` (1106-Preview)
2932
* `gpt-4` (0125-Preview)
33+
* `gpt-4o`
34+
* `gpt-4o-mini`
3035

3136
### API support
3237

@@ -47,7 +52,7 @@ client = AzureOpenAI(
4752
)
4853

4954
response = client.chat.completions.create(
50-
model="gpt-4-0125-Preview", # Model = should match the deployment name you chose for your 0125-Preview model deployment
55+
model="YOUR-MODEL_DEPLOYMENT_NAME", # Model = should match the deployment name you chose for your model deployment
5156
response_format={ "type": "json_object" },
5257
messages=[
5358
{"role": "system", "content": "You are a helpful assistant designed to output JSON."},

0 commit comments

Comments
 (0)