Skip to content

Commit 2c0a45d

Browse files
authored
Merge pull request #284712 from MicrosoftDocs/main
8/14/2024 PM Publish
2 parents fecd532 + 62277f6 commit 2c0a45d

File tree

59 files changed

+542
-380
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

59 files changed

+542
-380
lines changed

articles/ai-services/openai/concepts/model-retirements.md

Lines changed: 22 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the model deprecations and retirements in Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 08/08/2024
7+
ms.date: 08/14/2024
88
ms.custom:
99
manager: nitinme
1010
author: mrbullwinkle
@@ -74,9 +74,11 @@ Azure OpenAI notifies those who are members of the following roles for each subs
7474

7575
## How to get ready for model retirements and version upgrades
7676

77-
To prepare for model retirements and version upgrades, we recommend that customers evaluate their applications with the new models and versions and evaluate their behavior. We also recommend that customers update their applications to use the new models and versions before the retirement date.
77+
To prepare for model retirements and version upgrades, we recommend that customers test their applications with the new models and versions and evaluate their behavior. We also recommend that customers update their applications to use the new models and versions before the retirement date.
7878

79-
For more information, see [How to upgrade to a new model or version](./model-versions.md).
79+
For more information on the model evaluation process, see the [Getting started with model evaluation guide](https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/how-to-evaluate-amp-upgrade-model-versions-in-the-azure-openai/ba-p/4218880).
80+
81+
For information on the model upgrade process, see [How to upgrade to a new model or version](./model-versions.md).
8082

8183
## Current models
8284

@@ -87,22 +89,22 @@ For more information, see [How to upgrade to a new model or version](./model-ver
8789
8890
These models are currently available for use in Azure OpenAI Service.
8991

90-
| Model | Version | Retirement date |
91-
| ---- | ---- | ---- |
92-
| `gpt-35-turbo` | 0301 | No earlier than October 1, 2024 |
93-
| `gpt-35-turbo`<br>`gpt-35-turbo-16k` | 0613 | November 1, 2024 |
94-
| `gpt-35-turbo` | 1106 | No earlier than Nov 17, 2024 |
95-
| `gpt-35-turbo` | 0125 | No earlier than Feb 22, 2025 |
96-
| `gpt-4`<br>`gpt-4-32k` | 0314 | **Deprecation:** October 1, 2024 <br> **Retirement:** June 6, 2025 |
97-
| `gpt-4`<br>`gpt-4-32k` | 0613 | **Deprecation:** October 1, 2024 <br> **Retirement:** June 6, 2025 |
98-
| `gpt-4` | 1106-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** |
99-
| `gpt-4` | 0125-preview |To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** |
100-
| `gpt-4` | vision-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** |
101-
| `gpt-3.5-turbo-instruct` | 0914 | No earlier than Sep 14, 2025 |
102-
| `text-embedding-ada-002` | 2 | No earlier than April 3, 2025 |
103-
| `text-embedding-ada-002` | 1 | No earlier than April 3, 2025 |
104-
| `text-embedding-3-small` | | No earlier than Feb 2, 2025 |
105-
| `text-embedding-3-large` | | No earlier than Feb 2, 2025 |
92+
| Model | Version | Retirement date | Suggested replacement |
93+
| ---- | ---- | ---- | --- |
94+
| `gpt-35-turbo` | 0301 | No earlier than October 1, 2024 | `gpt-4o-mini` |
95+
| `gpt-35-turbo`<br>`gpt-35-turbo-16k` | 0613 | November 1, 2024 | `gpt-4o-mini` |
96+
| `gpt-35-turbo` | 1106 | No earlier than Nov 17, 2024 | `gpt-4o-mini` |
97+
| `gpt-35-turbo` | 0125 | No earlier than Feb 22, 2025 | `gpt-4o-mini` |
98+
| `gpt-4`<br>`gpt-4-32k` | 0314 | **Deprecation:** October 1, 2024 <br> **Retirement:** June 6, 2025 | `gpt-4o` |
99+
| `gpt-4`<br>`gpt-4-32k` | 0613 | **Deprecation:** October 1, 2024 <br> **Retirement:** June 6, 2025 | `gpt-4o` |
100+
| `gpt-4` | 1106-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** | `gpt-4o`|
101+
| `gpt-4` | 0125-preview |To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** | `gpt-4o` |
102+
| `gpt-4` | vision-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on November 15, 2024, or later **<sup>1</sup>** | `gpt-4o`|
103+
| `gpt-3.5-turbo-instruct` | 0914 | No earlier than Sep 14, 2025 | |
104+
| `text-embedding-ada-002` | 2 | No earlier than April 3, 2025 | `text-embedding-3-small` or `text-embedding-3-large` |
105+
| `text-embedding-ada-002` | 1 | No earlier than April 3, 2025 | `text-embedding-3-small` or `text-embedding-3-large` |
106+
| `text-embedding-3-small` | | No earlier than Feb 2, 2025 | |
107+
| `text-embedding-3-large` | | No earlier than Feb 2, 2025 | |
106108

107109
**<sup>1</sup>** We will notify all customers with these preview deployments at least 30 days before the start of the upgrades. We will publish an upgrade schedule detailing the order of regions and model versions that we will follow during the upgrades, and link to that schedule from here.
108110

@@ -114,7 +116,7 @@ These models were deprecated on July 6, 2023 and were retired on June 14, 2024.
114116
If you're an existing customer looking for information about these models, see [Legacy models](./legacy-models.md).
115117

116118
| Model | Deprecation date | Retirement date | Suggested replacement |
117-
| --------- | --------------------- | ------------------- | -------------------- |
119+
| --------- | --------------------- | ------------------- | -------------------- |
118120
| ada | July 6, 2023 | June 14, 2024 | babbage-002 |
119121
| babbage | July 6, 2023 | June 14, 2024 | babbage-002 |
120122
| curie | July 6, 2023 | June 14, 2024 | davinci-002 |

articles/ai-services/openai/concepts/models.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 08/07/2024
7+
ms.date: 08/14/2024
88
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
@@ -335,7 +335,7 @@ These models can only be used with Embedding API requests.
335335

336336
### Assistants (Preview)
337337

338-
For Assistants you need a combination of a supported model, and a supported region. Certain tools and capabilities require the latest models. The following models are available in the Assistants API, SDK, Azure AI Studio and Azure OpenAI Studio. The following table is for pay-as-you-go. For information on Provisioned Throughput Unit (PTU) availability, see [provisioned throughput](./provisioned-throughput.md). The listed models and regions can be used with both Assistants v1 and v2.
338+
For Assistants you need a combination of a supported model, and a supported region. Certain tools and capabilities require the latest models. The following models are available in the Assistants API, SDK, Azure AI Studio and Azure OpenAI Studio. The following table is for pay-as-you-go. For information on Provisioned Throughput Unit (PTU) availability, see [provisioned throughput](./provisioned-throughput.md). The listed models and regions can be used with both Assistants v1 and v2. You can use [global standard models](#global-standard-model-availability) if they are supported in the regions listed below.
339339

340340
| Region | `gpt-35-turbo (0613)` | `gpt-35-turbo (1106)`| `fine tuned gpt-3.5-turbo-0125` | `gpt-4 (0613)` | `gpt-4 (1106)` | `gpt-4 (0125)` | `gpt-4o (2024-05-13)` | `gpt-4o-mini (2024-07-18)` |
341341
|-----|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|

articles/ai-services/openai/faq.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ metadata:
77
manager: nitinme
88
ms.service: azure-ai-openai
99
ms.topic: faq
10-
ms.date: 08/06/2024
10+
ms.date: 08/13/2024
1111
ms.author: mbullwin
1212
author: mrbullwinkle
1313
title: Azure OpenAI Service frequently asked questions
@@ -270,7 +270,7 @@ sections:
270270
answer: |
271271
* Inference cost (input and output) of the base model you're using for each Assistant (for example gpt-4-0125). If you've created multiple Assistants, you will be charged for the base model attached to each Assistant.
272272
* If you've enabled the Code Interpreter tool. For example if your assistant calls Code Interpreter simultaneously in two different threads, this would create two Code Interpreter sessions, each of which would be charged. Each session is active by default for one hour, which means that you would only pay this fee once if your user keeps giving instructions to Code Interpreter in the same thread for up to one hour.
273-
* File search is currently not billed.
273+
* File search is billed based on the vector storage used.
274274
275275
For more information, see the [pricing page](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/).
276276
- question: |

0 commit comments

Comments
 (0)