You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/quota.md
+5-7Lines changed: 5 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -44,13 +44,11 @@ The flexibility to distribute TPM globally within a subscription and region has
44
44
45
45
When you create a model deployment, you have the option to assign Tokens-Per-Minute (TPM) to that deployment. TPM can be modified in increments of 1,000, and will map to the TPM and RPM rate limits enforced on your deployment, as discussed above.
46
46
47
-
To create a new deployment from within the Azure AI Studio under **Management** select **Deployments** > **Create new deployment**.
48
-
49
-
The option to set the TPM is under the **Advanced options** drop-down:
47
+
To create a new deployment from within the Azure AI Studio under **Shared Resources** select **Deployments** > **Deploy model** > **Deploy base model** > **Select Model** > **Confirm**.
50
48
51
49
:::image type="content" source="../media/quota/deployment.png" alt-text="Screenshot of the deployment UI of Azure AI Studio" lightbox="../media/quota/deployment.png":::
52
50
53
-
Post deployment you can adjust your TPM allocation by selecting **Edit deployment** under **Management** > **Deployments** in Azure AI Studio. You can also modify this selection within the new quota management experience under **Management** > **Quotas**.
51
+
Post deployment you can adjust your TPM allocation by selecting **Edit** under **Shared resources** > **Deployments** in Azure OpenAI Studio. You can also modify this selection within the new quota management experience under **Management** > **Quotas**.
54
52
55
53
> [!IMPORTANT]
56
54
> Quotas and limits are subject to change, for the most up-date-information consult our [quotas and limits article](../quotas-limits.md).
@@ -70,10 +68,10 @@ For an all up view of your quota allocations across deployments in a given regio
70
68
71
69
:::image type="content" source="../media/quota/quota.png" alt-text="Screenshot of the quota UI of Azure AI Studio" lightbox="../media/quota/quota.png":::
72
70
73
-
-**Quota Name**: There's one quota value per region for each model type. The quota covers all versions of that model. The quota name can be expanded in the UI to show the deployments that are using the quota.
74
71
-**Deployment**: Model deployments divided by model class.
75
-
-**Usage/Limit**: For the quota name, this shows how much quota is used by deployments and the total quota approved for this subscription and region. This amount of quota used is also represented in the bar graph.
76
-
-**Request Quota**: The icon in this field navigates to a form where requests to increase quota can be submitted.
72
+
-**Quota type**: There's one quota value per region for each model type. The quota covers all versions of that model.
73
+
-**Quota allocation**: For the quota name, this shows how much quota is used by deployments and the total quota approved for this subscription and region. This amount of quota used is also represented in the bar graph.
74
+
-**Request Quota**: The icon navigates to a form where requests to increase quota can be submitted.
0 commit comments