Skip to content

Commit 51e566e

Browse files
authored
Merge pull request #1538 from MicrosoftDocs/main
11/12/2024 AM Publish
2 parents d63c476 + e9794a4 commit 51e566e

File tree

14 files changed

+1163
-160
lines changed

14 files changed

+1163
-160
lines changed

articles/ai-services/openai/concepts/model-retirements.md

Lines changed: 23 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the model deprecations and retirements in Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 10/25/2024
7+
ms.date: 11/11/2024
88
ms.custom:
99
manager: nitinme
1010
author: mrbullwinkle
@@ -91,26 +91,26 @@ These models are currently available for use in Azure OpenAI Service.
9191

9292
| Model | Version | Retirement date | Suggested replacements |
9393
| ---- | ---- | ---- | --- |
94-
| `babbage-002` | 1 | Deprecation Date: November 15, 2024 <br>Retirement Date: January 27, 2025 | |
95-
| `davinci-002` | 1 | Deprecation Date: November 15, 2024 <br>Retirement Date: January 27, 2025 | |
94+
| `babbage-002` | 1 | Retirement Date: January 27, 2025 | |
95+
| `davinci-002` | 1 | Retirement Date: January 27, 2025 | |
9696
| `dall-e-2`| 2 | January 27, 2025 | `dalle-3` |
9797
| `dall-e-3` | 3 | No earlier than April 30, 2025 | |
98-
| `gpt-35-turbo` | 0301 | January 27, 2025<br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 13, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini` |
99-
| `gpt-35-turbo`<br>`gpt-35-turbo-16k` | 0613 | January 27, 2025 <br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 13, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini`|
100-
| `gpt-35-turbo` | 1106 | No earlier than January 27, 2025 <br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 13, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini` |
101-
| `gpt-35-turbo` | 0125 | No earlier than Feb 22, 2025 | `gpt-4o-mini` |
98+
| `gpt-35-turbo` | 0301 | February 13, 2025<br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on January 13, 2025. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini` |
99+
| `gpt-35-turbo`<br>`gpt-35-turbo-16k` | 0613 | February 13, 2025 <br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on January 13, 2025. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini`|
100+
| `gpt-35-turbo` | 1106 | No earlier than January 27, 2025 <br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on January 13, 2025. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini` |
101+
| `gpt-35-turbo` | 0125 | No earlier than March 31, 2025 | `gpt-4o-mini` |
102102
| `gpt-4`<br>`gpt-4-32k` | 0314 | June 6, 2025 | `gpt-4o` |
103103
| `gpt-4`<br>`gpt-4-32k` | 0613 | June 6, 2025 | `gpt-4o` |
104104
| `gpt-4` | 1106-preview | To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting no sooner than January 27, 2025 **<sup>1</sup>** | `gpt-4o`|
105105
| `gpt-4` | 0125-preview |To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting no sooner than January 27, 2025 **<sup>1</sup>** | `gpt-4o` |
106106
| `gpt-4` | vision-preview | To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting no sooner than January 27, 2025 **<sup>1</sup>** | `gpt-4o`|
107-
| `gpt-4o` | 2024-05-13 | No earlier than May 20, 2025 <br><br>Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `2024-08-06`, starting on December 5, 2024. | |
107+
| `gpt-4o` | 2024-05-13 | No earlier than May 20, 2025 <br><br>Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `2024-08-06`, starting on February 13, 2025. | |
108108
| `gpt-4o-mini` | 2024-07-18 | No earlier than July 18, 2025 | |
109109
| `gpt-3.5-turbo-instruct` | 0914 | No earlier than Sep 14, 2025 | |
110110
| `text-embedding-ada-002` | 2 | No earlier than April 3, 2025 | `text-embedding-3-small` or `text-embedding-3-large` |
111111
| `text-embedding-ada-002` | 1 | No earlier than April 3, 2025 | `text-embedding-3-small` or `text-embedding-3-large` |
112-
| `text-embedding-3-small` | | No earlier than Feb 2, 2025 | |
113-
| `text-embedding-3-large` | | No earlier than Feb 2, 2025 | |
112+
| `text-embedding-3-small` | | No earlier than April 3, 2025 | |
113+
| `text-embedding-3-large` | | No earlier than April 3, 2025 | |
114114

115115
**<sup>1</sup>** We will notify all customers with these preview deployments at least 30 days before the start of the upgrades. We will publish an upgrade schedule detailing the order of regions and model versions that we will follow during the upgrades, and link to that schedule from here.
116116

@@ -121,8 +121,8 @@ These models are currently available for use in Azure OpenAI Service.
121121

122122
| Model | Current default version | New default version | Default upgrade date |
123123
|---|---|---|---|
124-
| `gpt-35-turbo` | 0301 | 0125 | Deployments of versions `0301`, `0613`, and `1106` set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 13, 2024.|
125-
| `gpt-4o` | 2024-05-13 | 2024-08-06 | Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `2024-08-06`, starting on December 5, 2024. |
124+
| `gpt-35-turbo` | 0301 | 0125 | Deployments of versions `0301`, `0613`, and `1106` set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on January 13, 2025.|
125+
| `gpt-4o` | 2024-05-13 | 2024-08-06 | Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `2024-08-06`, starting on February 13, 2025. |
126126

127127
## Deprecated models
128128

@@ -162,6 +162,17 @@ If you're an existing customer looking for information about these models, see [
162162

163163
## Retirement and deprecation history
164164

165+
## November 11, 2024
166+
167+
Updates to:
168+
169+
- `babbage-002`, `davinci-002`.
170+
- `gpt-35-turbo` DEFAULT model version update date.
171+
- `gpt-35-turbo` 0301, 0613 retirement date.
172+
- `gpt-35-turbo` 0125 retirement date.
173+
- `gpt-4o` DEFAULT model update date.
174+
- `text-embeddings-3-small` & `text-embedding-3-large` retirement date.
175+
165176
## October 25, 2024
166177

167178
* `babbage-002` & `davinci-002` deprecation date: November 15, 2024 and retirement date: January 27, 2025.

articles/ai-services/openai/quotas-limits.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: conceptual
13-
ms.date: 10/23/2024
13+
ms.date: 11/11/2024
1414
ms.author: mbullwin
1515
---
1616

@@ -51,8 +51,9 @@ The following sections provide you with a quick guide to the default quotas and
5151
| GPT-4o max images per request (# of images in the messages array/conversation history) | 10 |
5252
| GPT-4 `vision-preview` & GPT-4 `turbo-2024-04-09` default max tokens | 16 <br><br> Increase the `max_tokens` parameter value to avoid truncated responses. GPT-4o max tokens defaults to 4096. |
5353
| Max number of custom headers in API requests<sup>1</sup> | 10 |
54+
| Max number requests per minute<br/><br/>Current rate limits for real time audio (`gpt-4o-realtime-preview`) are defined as the number of new websocket connections per minute. For example, 6 request per minute (RPM) means 6 new connections per minute. At this time, the usage limits for `gpt-4o-realtime-preview` are suitable for test and development. | 6 new connections per minute |
5455

55-
<sup>1</sup> Our current APIs allow up to 10 custom headers, which are passed through the pipeline, and returned. We have noticed some customers now exceed this header count resulting in HTTP 431 errors. There is no solution for this error, other than to reduce header volume. **In future API versions we will no longer pass through custom headers**. We recommend customers not depend on custom headers in future system architectures.
56+
<sup>1</sup> Our current APIs allow up to 10 custom headers, which are passed through the pipeline, and returned. Some customers now exceed this header count resulting in HTTP 431 errors. There's no solution for this error, other than to reduce header volume. **In future API versions we will no longer pass through custom headers**. We recommend customers not depend on custom headers in future system architectures.
5657

5758
## Regional quota limits
5859

@@ -180,7 +181,7 @@ To minimize issues related to rate limits, it's a good idea to use the following
180181

181182
### How to request increases to the default quotas and limits
182183

183-
Quota increase requests can be submitted from the [Quotas](./how-to/quota.md) page of Azure AI Studio. Note that due to overwhelming demand, quota increase requests are being accepted and will be filled in the order they are received. Priority will be given to customers who generate traffic that consumes the existing quota allocation, and your request might be denied if this condition isn't met.
184+
Quota increase requests can be submitted from the [Quotas](./how-to/quota.md) page of Azure AI Studio. Due to high demand, quota increase requests are being accepted and will be filled in the order they're received. Priority is given to customers who generate traffic that consumes the existing quota allocation, and your request might be denied if this condition isn't met.
184185

185186
For other rate limits, [submit a service request](../cognitive-services-support-options.md?context=/azure/ai-services/openai/context/context).
186187

0 commit comments

Comments
 (0)