You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/concepts/model-retirements.md
+22-20Lines changed: 22 additions & 20 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
4
4
description: Learn about the model deprecations and retirements in Azure OpenAI.
5
5
ms.service: azure-ai-openai
6
6
ms.topic: conceptual
7
-
ms.date: 08/08/2024
7
+
ms.date: 08/14/2024
8
8
ms.custom:
9
9
manager: nitinme
10
10
author: mrbullwinkle
@@ -74,9 +74,11 @@ Azure OpenAI notifies those who are members of the following roles for each subs
74
74
75
75
## How to get ready for model retirements and version upgrades
76
76
77
-
To prepare for model retirements and version upgrades, we recommend that customers evaluate their applications with the new models and versions and evaluate their behavior. We also recommend that customers update their applications to use the new models and versions before the retirement date.
77
+
To prepare for model retirements and version upgrades, we recommend that customers test their applications with the new models and versions and evaluate their behavior. We also recommend that customers update their applications to use the new models and versions before the retirement date.
78
78
79
-
For more information, see [How to upgrade to a new model or version](./model-versions.md).
79
+
For more information on the model evaluation process, see the [Getting started with model evaluation guide](https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/how-to-evaluate-amp-upgrade-model-versions-in-the-azure-openai/ba-p/4218880).
80
+
81
+
For information on the model upgrade process, see [How to upgrade to a new model or version](./model-versions.md).
80
82
81
83
## Current models
82
84
@@ -87,22 +89,22 @@ For more information, see [How to upgrade to a new model or version](./model-ver
87
89
88
90
These models are currently available for use in Azure OpenAI Service.
89
91
90
-
| Model | Version | Retirement date |
91
-
| ---- | ---- | ---- |
92
-
|`gpt-35-turbo`| 0301 | No earlier than October 1, 2024 |
93
-
|`gpt-35-turbo`<br>`gpt-35-turbo-16k`| 0613 | November 1, 2024 |
94
-
|`gpt-35-turbo`| 1106 | No earlier than Nov 17, 2024 |
95
-
|`gpt-35-turbo`| 0125 | No earlier than Feb 22, 2025 |
96
-
|`gpt-4`<br>`gpt-4-32k`| 0314 |**Deprecation:** October 1, 2024 <br> **Retirement:** June 6, 2025 |
97
-
|`gpt-4`<br>`gpt-4-32k`| 0613 |**Deprecation:** October 1, 2024 <br> **Retirement:** June 6, 2025 |
98
-
|`gpt-4`| 1106-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>**|
99
-
|`gpt-4`| 0125-preview |To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>**|
100
-
|`gpt-4`| vision-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>**|
101
-
|`gpt-3.5-turbo-instruct`| 0914 | No earlier than Sep 14, 2025 |
102
-
|`text-embedding-ada-002`| 2 | No earlier than April 3, 2025 |
103
-
|`text-embedding-ada-002`| 1 | No earlier than April 3, 2025 |
104
-
|`text-embedding-3-small`|| No earlier than Feb 2, 2025 |
105
-
|`text-embedding-3-large`|| No earlier than Feb 2, 2025 |
92
+
| Model | Version | Retirement date | Suggested replacement |
93
+
| ---- | ---- | ---- | --- |
94
+
|`gpt-35-turbo`| 0301 | No earlier than October 1, 2024 |`gpt-4o-mini`|
95
+
|`gpt-35-turbo`<br>`gpt-35-turbo-16k`| 0613 | November 1, 2024 |`gpt-4o-mini`|
96
+
|`gpt-35-turbo`| 1106 | No earlier than Nov 17, 2024 |`gpt-4o-mini`|
97
+
|`gpt-35-turbo`| 0125 | No earlier than Feb 22, 2025 |`gpt-4o-mini`|
98
+
|`gpt-4`<br>`gpt-4-32k`| 0314 |**Deprecation:** October 1, 2024 <br> **Retirement:** June 6, 2025 |`gpt-4o`|
99
+
|`gpt-4`<br>`gpt-4-32k`| 0613 |**Deprecation:** October 1, 2024 <br> **Retirement:** June 6, 2025 |`gpt-4o`|
100
+
|`gpt-4`| 1106-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>**|`gpt-4o`|
101
+
|`gpt-4`| 0125-preview |To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>**|`gpt-4o`|
102
+
|`gpt-4`| vision-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>**|`gpt-4o`|
103
+
|`gpt-3.5-turbo-instruct`| 0914 | No earlier than Sep 14, 2025 ||
104
+
|`text-embedding-ada-002`| 2 | No earlier than April 3, 2025 |`text-embedding-3-small` or `text-embedding-3-large`|
105
+
|`text-embedding-ada-002`| 1 | No earlier than April 3, 2025 |`text-embedding-3-small` or `text-embedding-3-large`|
106
+
|`text-embedding-3-small`|| No earlier than Feb 2, 2025 ||
107
+
|`text-embedding-3-large`|| No earlier than Feb 2, 2025 ||
106
108
107
109
**<sup>1</sup>** We will notify all customers with these preview deployments at least 30 days before the start of the upgrades. We will publish an upgrade schedule detailing the order of regions and model versions that we will follow during the upgrades, and link to that schedule from here.
108
110
@@ -114,7 +116,7 @@ These models were deprecated on July 6, 2023 and were retired on June 14, 2024.
114
116
If you're an existing customer looking for information about these models, see [Legacy models](./legacy-models.md).
115
117
116
118
| Model | Deprecation date | Retirement date | Suggested replacement |
@@ -335,7 +335,7 @@ These models can only be used with Embedding API requests.
335
335
336
336
### Assistants (Preview)
337
337
338
-
For Assistants you need a combination of a supported model, and a supported region. Certain tools and capabilities require the latest models. The following models are available in the Assistants API, SDK, Azure AI Studio and Azure OpenAI Studio. The following table is for pay-as-you-go. For information on Provisioned Throughput Unit (PTU) availability, see [provisioned throughput](./provisioned-throughput.md). The listed models and regions can be used with both Assistants v1 and v2.
338
+
For Assistants you need a combination of a supported model, and a supported region. Certain tools and capabilities require the latest models. The following models are available in the Assistants API, SDK, Azure AI Studio and Azure OpenAI Studio. The following table is for pay-as-you-go. For information on Provisioned Throughput Unit (PTU) availability, see [provisioned throughput](./provisioned-throughput.md). The listed models and regions can be used with both Assistants v1 and v2. You can use [global standard models](#global-standard-model-availability) if they are supported in the regions listed below.
Copy file name to clipboardExpand all lines: articles/ai-services/openai/faq.yml
+2-2Lines changed: 2 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ metadata:
7
7
manager: nitinme
8
8
ms.service: azure-ai-openai
9
9
ms.topic: faq
10
-
ms.date: 08/06/2024
10
+
ms.date: 08/13/2024
11
11
ms.author: mbullwin
12
12
author: mrbullwinkle
13
13
title: Azure OpenAI Service frequently asked questions
@@ -270,7 +270,7 @@ sections:
270
270
answer: |
271
271
* Inference cost (input and output) of the base model you're using for each Assistant (for example gpt-4-0125). If you've created multiple Assistants, you will be charged for the base model attached to each Assistant.
272
272
* If you've enabled the Code Interpreter tool. For example if your assistant calls Code Interpreter simultaneously in two different threads, this would create two Code Interpreter sessions, each of which would be charged. Each session is active by default for one hour, which means that you would only pay this fee once if your user keeps giving instructions to Code Interpreter in the same thread for up to one hour.
273
-
* File search is currently not billed.
273
+
* File search is billed based on the vector storage used.
274
274
275
275
For more information, see the [pricing page](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/).
0 commit comments