Skip to content

Commit c57e2fa

Browse files
Merge pull request #283890 from mrbullwinkle/mrb_08_07_2024_models
[Azure OpenAI] 3 PM publish updates
2 parents bd92835 + 88c957f commit c57e2fa

File tree

4 files changed

+15
-10
lines changed

4 files changed

+15
-10
lines changed

articles/ai-services/openai/concepts/model-retirements.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the model deprecations and retirements in Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 08/01/2024
7+
ms.date: 08/07/2024
88
ms.custom:
99
manager: nitinme
1010
author: mrbullwinkle
@@ -82,6 +82,8 @@ For more information, see [How to upgrade to a new model or version](./model-ver
8282

8383
> [!NOTE]
8484
> Not all models go through a deprecation period prior to retirement. Some models/versions only have a retirement date.
85+
>
86+
> **Fine-tuned models** are subject to the same deprecation and retirement schedule as their equivalent base model.
8587
8688
These models are currently available for use in Azure OpenAI Service.
8789

articles/ai-services/openai/concepts/models.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 08/06/2024
7+
ms.date: 08/07/2024
88
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
@@ -83,9 +83,9 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
8383

8484
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
8585
| --- | :--- |:--- |:---: |
86-
|`gpt-4o-mini` (2024-07-18) <br> **GPT-4o mini** | **Latest small GA model** <br> - Fast, inexpensive, capable model ideal for replacing GPT-3.5 Turbo series models. <br> - Text, image processing <br>- JSON Mode <br> - parallel function calling <br> - **Does not support enhancements** | Input: 128,000 <br> Output: 16,384 | Oct 2023 |
87-
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks <br> - **Does not support enhancements** |Input: 128,000 <br> Output: 4,096| Oct 2023 |
88-
| `gpt-4` (turbo-2024-04-09) <br>**GPT-4 Turbo with Vision** | **New GA model** <br> - Replacement for all previous GPT-4 preview models (`vision-preview`, `1106-Preview`, `0125-Preview`). <br> - [**Feature availability**](#gpt-4o-and-gpt-4-turbo) is currently different depending on method of input, and deployment type. <br> - **Does not support enhancements**. | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
86+
|`gpt-4o-mini` (2024-07-18) <br> **GPT-4o mini** | **Latest small GA model** <br> - Fast, inexpensive, capable model ideal for replacing GPT-3.5 Turbo series models. <br> - Text, image processing <br>- JSON Mode <br> - parallel function calling | Input: 128,000 <br> Output: 16,384 | Oct 2023 |
87+
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
88+
| `gpt-4` (turbo-2024-04-09) <br>**GPT-4 Turbo with Vision** | **New GA model** <br> - Replacement for all previous GPT-4 preview models (`vision-preview`, `1106-Preview`, `0125-Preview`). <br> - [**Feature availability**](#gpt-4o-and-gpt-4-turbo) is currently different depending on method of input, and deployment type. | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
8989
| `gpt-4` (0125-Preview)*<br>**GPT-4 Turbo Preview** | **Preview Model** <br> -Replaces 1106-Preview <br>- Better code generation performance <br> - Reduces cases where the model doesn't complete a task <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Dec 2023 |
9090
| `gpt-4` (vision-preview)<br>**GPT-4 Turbo with Vision Preview** | **Preview model** <br> - Accepts text and image input. <br> - Supports enhancements <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Apr 2023 |
9191
| `gpt-4` (1106-Preview)<br>**GPT-4 Turbo Preview** | **Preview Model** <br> - JSON Mode <br> - parallel function calling <br> - reproducible output (preview) | Input: 128,000 <br> Output: 4,096 | Apr 2023 |

articles/ai-services/openai/faq.yml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -23,9 +23,9 @@ sections:
2323
- name: General
2424
questions:
2525
- question: |
26-
Does Azure OpenAI support custom API headers? We append additional custom headers to our API requests and are seeing failures occur.
26+
Does Azure OpenAI support custom API headers? We append additional custom headers to our API requests and are seeing HTTP 431 failure errors.
2727
answer: |
28-
Azure OpenAI currently only supports a maximum of 10 custom API headers.
28+
Our current APIs allow up to 10 custom headers, which are passed through the pipeline, and returned. We have noticed some customers now exceed this header count resulting in HTTP 431 errors. There is no solution for this error, other than to reduce header volume. In future API versions we will no longer pass through custom headers. We recommend customers not depend on custom headers in future system architectures.
2929
- question: |
3030
Does Azure OpenAI work with the latest Python library released by OpenAI (version>=1.0)?
3131
answer: |
@@ -41,7 +41,7 @@ sections:
4141
- question: |
4242
How do the capabilities of Azure OpenAI compare to OpenAI?
4343
answer: |
44-
Azure OpenAI Service gives customers advanced language AI with OpenAI GPT-3, Codex, and DALL-E models with the security and enterprise promise of Azure. Azure OpenAI co-develops the APIs with OpenAI, ensuring compatibility and a smooth transition from one to the other.
44+
Azure OpenAI Service gives customers advanced language AI with OpenAI GPT-3, Codex, and DALL-E models with the security and enterprise promise of Azure. Azure OpenAI codevelops the APIs with OpenAI, ensuring compatibility and a smooth transition from one to the other.
4545
4646
With Azure OpenAI, customers get the security capabilities of Microsoft Azure while running the same models as OpenAI.
4747
- question: |
@@ -203,7 +203,7 @@ sections:
203203
- question: |
204204
What type of files can I upload?
205205
answer: |
206-
We currently support PNG (.png), JPEG (.jpeg and .jpg), WEBP (.webp), and non-animated GIF (.gif).
206+
We currently support PNG (.png), JPEG (.jpeg and .jpg), WEBP (.webp), and nonanimated GIF (.gif).
207207
- question: |
208208
Is there a limit to the size of the image I can upload?
209209
answer: |

articles/ai-services/openai/quotas-limits.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,10 @@ The following sections provide you with a quick guide to the default quotas and
4848
| Assistants token limit | 2,000,000 token limit |
4949
| GPT-4o max images per request (# of images in the messages array/conversation history) | 10 |
5050
| GPT-4 `vision-preview` & GPT-4 `turbo-2024-04-09` default max tokens | 16 <br><br> Increase the `max_tokens` parameter value to avoid truncated responses. GPT-4o max tokens defaults to 4096. |
51-
| Max number of custom headers in API requests | 10 |
51+
| Max number of custom headers in API requests<sup>1</sup> | 10 |
52+
53+
<sup>1</sup> Our current APIs allow up to 10 custom headers, which are passed through the pipeline, and returned. We have noticed some customers now exceed this header count resulting in HTTP 431 errors. There is no solution for this error, other than to reduce header volume. **In future API versions we will no longer pass through custom headers**. We recommend customers not depend on custom headers in future system architectures.
54+
5255

5356
## Regional quota limits
5457

0 commit comments

Comments
 (0)