Skip to content

Commit 264d2cf

Browse files
committed
update
1 parent 91fd444 commit 264d2cf

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

articles/ai-services/openai/concepts/provisioned-throughput.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Azure OpenAI Service provisioned throughput
33
description: Learn about provisioned throughput and Azure OpenAI.
44
ms.service: azure-ai-openai
55
ms.topic: conceptual
6-
ms.date: 07/32/2024
6+
ms.date: 07/23/2024
77
manager: nitinme
88
author: mrbullwinkle #ChrisHMSFT
99
ms.author: mbullwin #chrhoder
@@ -79,7 +79,7 @@ Provisioned quota is granted on a per subscription/region basis, and unlike Stan
7979

8080
The new quota shows up in the AI Studio and Azure OpenAI Studio as a quota item named **Provisioned Managed Throughput Unit**. In the Studio Quota pane, expanding the quota item will show the deployments contributing to usage of the quota.
8181

82-
:::image type="content" source="../media/provisioned/quota.png" alt-text="Screenshot of new quota UI for Azure OpenAI provisioned." lightbox="../media/provisioned/quota.png":::
82+
:::image type="content" source="../media/provisioned/quota.png" alt-text="Screenshot of quota UI for Azure OpenAI provisioned." lightbox="../media/provisioned/quota.png":::
8383

8484
## Capacity transparency and quota definitions
8585

@@ -96,17 +96,17 @@ To assist users to find the capacity needed for their deployments, customers wil
9696

9797
In AI Studio and Azure OpenAI Studio, the deployment experience will identify when a region lacks the capacity to support the desired model, version and number of PTUs, and will direct the user to a select an alternative region when needed.
9898

99-
:::image type="content" source="../media/provisioned/check-capacity.png" alt-text="Screenshot of new quota UI for Azure OpenAI provisioned." lightbox="./media/provisioned/check-capacity.png":::
99+
:::image type="content" source="../media/provisioned/check-capacity.png" alt-text="Screenshot of the check capacity experience for quota for Azure OpenAI provisioned." lightbox="../media/provisioned/check-capacity.png":::
100100

101101
Details on the new deployment experience can be found in the updated Azure OpenAI [provisioned onboarding guide](../how-to/provisioned-throughput-onboarding.md).
102102

103-
The new [model capacities API](/rest/api/aiservices/accountmanagement/model-capacities/list?view=rest-aiservices-accountmanagement-2024-04-01-preview&tabs=HTTP) can also be used to programmatically identify the maximum sized deployment of a specified model that can be created in each region based on the availability of both quota in the subscription and service capacity in the region.
103+
The new [model capacities API](/rest/api/aiservices/accountmanagement/model-capacities/list?view=rest-aiservices-accountmanagement-2024-04-01-preview&tabs=HTTP&preserve-view=true) can also be used to programmatically identify the maximum sized deployment of a specified model that can be created in each region based on the availability of both quota in the subscription and service capacity in the region.
104104

105105
If an acceptable region isn't available to support the desire model, version and/or PTUs, customers can also try the following steps:
106106

107107
- Attempt the deployment with a smaller number of PTUs.
108108
- Attempt the deployment at a different time. Capacity availability changes dynamically based on customer demand and more capacity may become available later.
109-
- Ensure that quota is available in all acceptable regions. The [model capacities API](/rest/api/aiservices/accountmanagement/model-capacities/list?view=rest-aiservices-accountmanagement-2024-04-01-preview&tabs=HTTP) and Studio experience consider quota availability in returning alternative regions for creating a deployment.
109+
- Ensure that quota is available in all acceptable regions. The [model capacities API](/rest/api/aiservices/accountmanagement/model-capacities/list?view=rest-aiservices-accountmanagement-2024-04-01-preview&tabs=HTTP&preserve-view=true) and Studio experience consider quota availability in returning alternative regions for creating a deployment.
110110

111111
### Determining the number of PTUs needed for a workload
112112

0 commit comments

Comments
 (0)