Skip to content

Commit 3d10ef5

Browse files
committed
update info on RBAC prereqs
1 parent 98b6db6 commit 3d10ef5

File tree

1 file changed

+7
-1
lines changed
  • articles/ai-services/openai/how-to

1 file changed

+7
-1
lines changed

articles/ai-services/openai/how-to/quota.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,14 +8,20 @@ manager: nitinme
88
ms.service: cognitive-services
99
ms.subservice: openai
1010
ms.topic: how-to
11-
ms.date: 07/18/2023
11+
ms.date: 07/20/2023
1212
ms.author: mbullwin
1313
---
1414

1515
# Manage Azure OpenAI Service quota
1616

1717
Quota provides the flexibility to actively manage the allocation of rate limits across the deployments within your subscription. This article walks through the process of managing your Azure OpenAI quota.
1818

19+
## Prerequisites
20+
21+
**Cognitive Services Usages Reader** - This role provides the minimal access necessary to properly view quota usage.
22+
23+
This role can be found in the Azure portal under **Subscriptions** > **Access control (IAM)** > **Add role assignment** > search for **Cognitive Services Usages Reader**.
24+
1925
## Introduction to quota
2026

2127
Azure OpenAI's quota feature enables assignment of rate limits to your deployments, up-to a global limit called your “quota.” Quota is assigned to your subscription on a per-region, per-model basis in units of **Tokens-per-Minute (TPM)**. When you onboard a subscription to Azure OpenAI, you'll receive default quota for most available models. Then, you'll assign TPM to each deployment as it is created, and the available quota for that model will be reduced by that amount. You can continue to create deployments and assign them TPM until you reach your quota limit. Once that happens, you can only create new deployments of that model by reducing the TPM assigned to other deployments of the same model (thus freeing TPM for use), or by requesting and being approved for a model quota increase in the desired region.

0 commit comments

Comments
 (0)