Skip to content

Commit 5925257

Browse files
committed
Merge branch 'main' of github.com:MicrosoftDocs/azure-ai-docs-pr into 245675-data-aoai-sec
2 parents ebf5636 + 8415bc2 commit 5925257

17 files changed

+170
-42
lines changed

articles/ai-services/openai/concepts/model-retirements.md

Lines changed: 9 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the model deprecations and retirements in Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 09/09/2024
7+
ms.date: 09/12/2024
88
ms.custom:
99
manager: nitinme
1010
author: mrbullwinkle
@@ -91,9 +91,9 @@ These models are currently available for use in Azure OpenAI Service.
9191

9292
| Model | Version | Retirement date | Suggested replacements |
9393
| ---- | ---- | ---- | --- |
94-
| `gpt-35-turbo` | 0301 | January 27, 2025<br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 15, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini` |
95-
| `gpt-35-turbo`<br>`gpt-35-turbo-16k` | 0613 | January 27, 2025 <br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 15, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini`|
96-
| `gpt-35-turbo` | 1106 | No earlier than Nov 17, 2024 <br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 15, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini` |
94+
| `gpt-35-turbo` | 0301 | January 27, 2025<br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 13, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini` |
95+
| `gpt-35-turbo`<br>`gpt-35-turbo-16k` | 0613 | January 27, 2025 <br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 13, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini`|
96+
| `gpt-35-turbo` | 1106 | No earlier than Nov 17, 2024 <br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 13, 2024. | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini` |
9797
| `gpt-35-turbo` | 0125 | No earlier than Feb 22, 2025 | `gpt-4o-mini` |
9898
| `gpt-4`<br>`gpt-4-32k` | 0314 | June 6, 2025 | `gpt-4o` |
9999
| `gpt-4`<br>`gpt-4-32k` | 0613 | June 6, 2025 | `gpt-4o` |
@@ -115,7 +115,7 @@ These models are currently available for use in Azure OpenAI Service.
115115

116116
| Model | Current default version | New default version | Default upgrade date |
117117
|---|---|---|---|
118-
| `gpt-35-turbo` | 0301 | 0125 | Deployments of versions `0301`, `0613`, and `1106` set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 15, 2024.|
118+
| `gpt-35-turbo` | 0301 | 0125 | Deployments of versions `0301`, `0613`, and `1106` set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 13, 2024.|
119119

120120

121121

@@ -158,6 +158,10 @@ If you're an existing customer looking for information about these models, see [
158158

159159
## Retirement and deprecation history
160160

161+
## September 12, 2024
162+
163+
* `gpt-35-turbo` (0301), (0613), (1106) and `gpt-35-turbo-16k` (0613) auto-update to default upgrade date updated to November 13, 2024.
164+
161165
## September 9, 2024
162166

163167
* `gpt-35-turbo` (0301) and (0613) retirement changed to January 27, 2025.

articles/ai-services/openai/concepts/models.md

Lines changed: 18 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 09/09/2024
7+
ms.date: 09/12/2024
88
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
@@ -26,6 +26,23 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
2626
| [Whisper](#whisper-models) | A series of models in preview that can transcribe and translate speech to text. |
2727
| [Text to speech](#text-to-speech-models-preview) (Preview) | A series of models in preview that can synthesize text to speech. |
2828

29+
## o1-preview and o1-mini models limited access
30+
31+
The Azure OpenAI `o1-preview` and `o1-mini` models are specifically designed to tackle reasoning and problem-solving tasks with increased focus and capability. These models spend more time processing and understanding the user's request, making them exceptionally strong in areas like science, coding, and math compared to previous iterations.
32+
33+
### Availability
34+
35+
The `o1-preview` and `o1-mini` models are available in the East US2 region for limited access through the [AI Studio](https://ai.azure.com) early access playground. Data processing for the `o1` models may occur in a different region than where they are available for use.
36+
37+
To try the `o1-preview` and `o1-mini` models in the early access playground, **registration is required, and access will be granted based on Microsoft’s eligibility criteria**.
38+
39+
Request access: [limited access model application](https://aka.ms/oai/modelaccess)
40+
41+
Once access has been granted, you will need to:
42+
43+
1. Navigate to https://ai.azure.com/resources and select a resource in the `eastus2` region. If you do not have an Azure OpenAI resource in this region you will need to [create one](https://portal.azure.com/#create/Microsoft.CognitiveServicesOpenAI).
44+
2. Once the `eastus2` Azure OpenAI resource is selected, in the upper left-hand panel under **Playgrounds** select **Early access playground (preview)**.
45+
2946
## GPT-4o and GPT-4 Turbo
3047

3148
GPT-4o integrates text and images in a single model, enabling it to handle multiple data types simultaneously. This multimodal approach enhances accuracy and responsiveness in human-computer interactions. GPT-4o matches GPT-4 Turbo in English text and coding tasks while offering superior performance in non-English languages and vision tasks, setting new benchmarks for AI capabilities.

articles/ai-services/openai/how-to/provisioned-get-started.md

Lines changed: 18 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -75,7 +75,17 @@ After you purchase a commitment on your quota, you can create a deployment. To c
7575

7676
Important things to note:
7777
* The deployment dialog contains a reminder that you can purchase an Azure Reservation for Azure OpenAI Provisioned to obtain a significant discount for a term commitment.
78-
* There is a message that tells you the list, hourly price of the deployment that you would be charged if this deployment is not covered by a reservation. This is a list price that does not include any negotiated discounts for your company.
78+
79+
Once you have entered the deployment settings, click **Confirm Pricing** to continue. A pricing confirmation dialog will appear that will display the list price for the deployment, if you choose to pay for it on an hourly basis, with no Azure Reservation to provide a term discount.
80+
81+
If you are unsure of the costs, cancel the deployment and proceed once you understand the payment model and underlying costs for provisioned deployment. This step may prevent unexpected, high charges on your payment invoice. Resources to educate yourself include:
82+
83+
* [Azure Pricing Portal](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/)
84+
* [Understanding the provisioned throughput purchase model](provisioned-throughput-onboarding.md#understanding-the-provisioned-throughput-purchase-model)
85+
86+
The image below shows the pricing confirmation you will see. The price shown is an example only.
87+
88+
:::image type="content" source="../media/provisioned/confirm-pricing.png" alt-text="Screenshot showing the pricing confirmation screen." lightbox="../media/provisioned/confirm-pricing.png":::
7989

8090
If you wish to create your deployment programmatically, you can do so with the following Azure CLI command. Update the `sku-capacity` with the desired number of provisioned throughput units.
8191

@@ -110,16 +120,17 @@ Things to notice:
110120

111121
Selecting a resource and clicking **Switch resource** will cause the deployment dialog to redisplay using the selected resource. You can then proceed to create your deployment in the new region.
112122

113-
Learn more about the purchase model and how to purchase a reservation:
114-
115-
* [Azure OpenAI provisioned onboarding guide](./provisioned-throughput-onboarding.md)
116-
* [Guide for Azure OpenAI provisioned reservations](../concepts/provisioned-throughput.md)
117-
118123
## Optionally purchase a reservation
119124

120125
Following the creation of your deployment, you might want to purchase a term discount via an Azure Reservation. An Azure Reservation can provide a substantial discount on the hourly rate for users intending to use the deployment beyond a few days.
121126

122-
For more information on purchasing a reservation, see [Save costs with Microsoft Azure OpenAI service Provisioned Reservations](/azure/cost-management-billing/reservations/azure-openai).
127+
For more information on the purchase model and reservations, see:
128+
* [Save costs with Microsoft Azure OpenAI service provisioned reservations](/azure/cost-management-billing/reservations/azure-openai).
129+
* [Azure OpenAI provisioned onboarding guide](./provisioned-throughput-onboarding.md)
130+
* [Guide for Azure OpenAI provisioned reservations](../concepts/provisioned-throughput.md)
131+
132+
> [!IMPORTANT]
133+
> Capacity availability for model deployments is dynamic and changes frequently across regions and models. To prevent you from purchasing a reservation for more PTUs than you can use, create deployments first, and then purchase the Azure Reservation to cover the PTUs you have deployed. This best practice will ensure that you can take full advantage of the reservation discount and prevent you from purchasing a term commitment that you cannot use.
123134
124135
## Make your first inferencing calls
125136
The inferencing code for provisioned deployments is the same a standard deployment type. The following code snippet shows a chat completions call to a GPT-4 model. For your first time using these models programmatically, we recommend starting with our [quickstart guide](../quickstart.md). Our recommendation is to use the OpenAI library with version 1.0 or greater since this includes retry logic within the library.

articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Azure OpenAI Service Provisioned Throughput Units (PTU) onboarding
33
description: Learn about provisioned throughput units onboarding and Azure OpenAI.
44
ms.service: azure-ai-openai
55
ms.topic: conceptual
6-
ms.date: 08/07/2024
6+
ms.date: 09/12/2024
77
manager: nitinme
88
author: mrbullwinkle
99
ms.author: mbullwin
@@ -81,6 +81,7 @@ Customers that require long-term usage of provisioned deployments, however, migh
8181
> It is not recommended to scale production deployments according to incoming traffic and pay for them purely on an hourly basis. There are two reasons for this:
8282
> * The cost savings achieved by purchasing an Azure Reservation for Azure OpenAI Provisioned are significant, and it will be less expensive in many cases to maintain a deployment sized for full production volume paid for via a reservation than it would be to scale the deployment with incoming traffic.
8383
> * Having unused provisioned quota (PTUs) does not guarentee that capacity will be available to support increasing the size of the deployment when required. Quota limits the maximum number of PTUs that can be deployed, but it is not a capacity guarantee. Provisioned capacity for each region and modal dynamically changes throughout the day and might not be available when required. As a result, it is recommended to maintain a permanant deployment to cover your traffic needs (paid for via a reservation).
84+
> * Charges for deployments on a deleted resource will continue until the resource is purged. To prevent this, delete a resource’s deployment before deleting the resource. For more information, see [Recover or purge deleted Azure AI services resources](../../recover-purge-resources.md).
8485
8586
## Azure Reservations for Azure OpenAI Provisioned  
8687

@@ -103,9 +104,11 @@ Discounts on top of the hourly usage price can be obtained by purchasing an Azur
103104
* If the size of provisioned deployments within the scope of a reservation exceeds the amount of the reservation, the excess is charged at the hourly rate. For example, if deployments amounting to 250 PTUs exist within the scope of a 200 PTU reservation, 50 PTUs will be charged on an hourly basis until the deployment sizes are reduced to 200 PTUs, or a new reservation is created to cover the remaining 50.
104105

105106
* Reservations guarantee a discounted price for the selected term.  They do not reserve capacity on the service or guarantee that it will be available when a deployment is created. It is highly recommended that customers create deployments prior to purchasing a reservation to prevent from over-purchasing a reservation.
106-
107-
> [!NOTE]
108-
> The Azure role and tenant policy requirements to purchase a reservation are different than those required to create a deployment or Azure OpenAI resource. See Azure OpenAI [Provisioned reservation documentation](https://aka.ms/oai/docs/ptum-reservations) for more details.
107+
108+
> [!IMPORTANT]
109+
> * Capacity availability for model deployments is dynamic and changes frequently across regions and models. To prevent you from purchasing a reservation for more PTUs than you can use, create deployments first, and then purchase the Azure Reservation to cover the PTUs you have deployed. This best practice will ensure that you can take full advantage of the reservation discount and prevent you from purchasing a term commitment that you cannot use.
110+
>
111+
> * The Azure role and tenant policy requirements to purchase a reservation are different than those required to create a deployment or Azure OpenAI resource. Verify authorization to purchase reservations in advance of needing to do so. See Azure OpenAI [Provisioned reservation documentation](https://aka.ms/oai/docs/ptum-reservations) for more details.
109112
110113
## Important: Sizing Azure OpenAI Provisioned Reservations
111114

@@ -119,7 +122,7 @@ To assist customers with purchasing the correct reservation amounts. The total n
119122

120123
Managing Azure Reservations
121124

122-
After a reservation is created, it is a best practice monitor it to ensure it is receiving the usage you are expecting. This may be done via the Azure Reservation Portal or Azure Monitor. Details on these topics and others can be found here:
125+
After a reservation is created, it is a best practice monitor it to ensure it is receiving the usage you are expecting. This can be done via the Azure Reservation Portal or Azure Monitor. Details on these topics and others can be found here:
123126

124127
* [View Azure reservation utilization](/azure/cost-management-billing/reservations/reservation-utilization)
125128
* [View Azure Reservation purchase and refund transactions](/azure/cost-management-billing/reservations/view-purchase-refunds)
90.8 KB
Loading
-28.4 KB
Loading

articles/ai-services/openai/whats-new.md

Lines changed: 44 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,14 +10,57 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: whats-new
13-
ms.date: 9/03/2024
13+
ms.date: 9/12/2024
1414
recommendations: false
1515
---
1616

1717
# What's new in Azure OpenAI Service
1818

1919
This article provides a summary of the latest releases and major documentation updates for Azure OpenAI.
2020

21+
## September 2024
22+
23+
### NEW o1-preview and o1-mini models available for limited access
24+
25+
The Azure OpenAI `o1-preview` and `o1-mini` models are specifically designed to tackle reasoning and problem-solving tasks with increased focus and capability. These models spend more time processing and understanding the user's request, making them exceptionally strong in areas like science, coding, and math compared to previous iterations.
26+
27+
### Key capabilities of the o1 series
28+
29+
- Complex Code Generation: Capable of generating algorithms and handling advanced coding tasks to support developers.
30+
- Advanced Problem Solving: Ideal for comprehensive brainstorming sessions and addressing multifaceted challenges.
31+
- Complex Document Comparison: Perfect for analyzing contracts, case files, or legal documents to identify subtle differences.
32+
- Instruction Following and Workflow Management: Particularly effective for managing workflows requiring shorter contexts.
33+
34+
### Model variants
35+
36+
- `o1-preview`: `o1-preview` is the more capable of the `o1` series models.
37+
- `o1-mini`: `o1-mini` is the faster and cheaper of the `o1` series models.
38+
39+
Model version: `2024-09-12`
40+
41+
Request access: [limited access model application](https://aka.ms/oai/modelaccess)
42+
43+
### Limitations
44+
45+
The `o1` series models are currently in preview and do not include some features available in other models, such as image understanding and structured outputs which are available in the latest GPT-4o model. For many tasks, the generally available GPT-4o models may still be more suitable.
46+
47+
### Safety
48+
49+
OpenAI has incorporated additional safety measures into the `o1` models, including new techniques to help the models refuse unsafe requests. These advancements make the `o1` series some of the most robust models available.
50+
51+
### Availability
52+
53+
The `o1-preview` and `o1-mini` are available in the East US2 region for limited access through the [AI Studio](https://ai.azure.com) early access playground. Data processing for the `o1` models may occur in a different region than where they are available for use.
54+
55+
To try the `o1-preview` and `o1-mini` models in the early access playground **registration is required, and access will be granted based on Microsoft’s eligibility criteria.**
56+
57+
Request access: [limited access model application](https://aka.ms/oai/modelaccess)
58+
59+
Once access has been granted, you will need to:
60+
61+
1. Navigate to https://ai.azure.com/resources and select a resource in the `eastus2` region. If you do not have an Azure OpenAI resource in this region you will need to [create one](https://portal.azure.com/#create/Microsoft.CognitiveServicesOpenAI).
62+
2. Once the `eastus2` Azure OpenAI resource is selected, in the upper left-hand panel under **Playgrounds** select **Early access playground (preview)**.
63+
2164
## August 2024
2265

2366
### GPT-4o 2024-08-06 structured outputs

articles/ai-services/recover-purge-resources.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: eric-urban
66
manager: nitinme
77
ms.service: azure-ai-services
88
ms.topic: how-to
9-
ms.date: 11/15/2023
9+
ms.date: 09/12/2024
1010
ms.author: eur
1111
---
1212

@@ -17,7 +17,9 @@ This article provides instructions on how to recover or purge an Azure AI servic
1717
Once you delete a resource, you won't be able to create another one with the same name for 48 hours. To create a resource with the same name, you need to purge the deleted resource.
1818

1919
> [!NOTE]
20-
> The instructions in this article are applicable to both a multi-service resource and a single-service resource. A multi-service resource enables access to multiple Azure AI services using a single key and endpoint. On the other hand, a single-service resource enables access to just that specific Azure AI service for which the resource was created.
20+
> * The instructions in this article are applicable to both a multi-service resource and a single-service resource. A multi-service resource enables access to multiple Azure AI services using a single key and endpoint. On the other hand, a single-service resource enables access to just that specific Azure AI service for which the resource was created.
21+
>
22+
> * Charges for provisioned deployments on a deleted resource will continue until the resource is purged. To prevent this, delete a resource's deployment before deleting the resource.
2123
2224
## Recover a deleted resource
2325

0 commit comments

Comments
 (0)