Skip to content

Commit 6bf11c6

Browse files
authored
Merge pull request #385 from mrbullwinkle/mrb_09_19_2024_quota
[Azure OpenAI] Quota UX updates
2 parents 5c1ebda + 395f8da commit 6bf11c6

File tree

3 files changed

+5
-7
lines changed

3 files changed

+5
-7
lines changed

articles/ai-services/openai/how-to/quota.md

Lines changed: 5 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -44,13 +44,11 @@ The flexibility to distribute TPM globally within a subscription and region has
4444

4545
When you create a model deployment, you have the option to assign Tokens-Per-Minute (TPM) to that deployment. TPM can be modified in increments of 1,000, and will map to the TPM and RPM rate limits enforced on your deployment, as discussed above.
4646

47-
To create a new deployment from within the Azure AI Studio under **Management** select **Deployments** > **Create new deployment**.
48-
49-
The option to set the TPM is under the **Advanced options** drop-down:
47+
To create a new deployment from within the Azure AI Studio under **Shared Resources** select **Deployments** > **Deploy model** > **Deploy base model** > **Select Model** > **Confirm**.
5048

5149
:::image type="content" source="../media/quota/deployment.png" alt-text="Screenshot of the deployment UI of Azure AI Studio" lightbox="../media/quota/deployment.png":::
5250

53-
Post deployment you can adjust your TPM allocation by selecting **Edit deployment** under **Management** > **Deployments** in Azure AI Studio. You can also modify this selection within the new quota management experience under **Management** > **Quotas**.
51+
Post deployment you can adjust your TPM allocation by selecting **Edit** under **Shared resources** > **Deployments** in Azure OpenAI Studio. You can also modify this selection within the new quota management experience under **Management** > **Quotas**.
5452

5553
> [!IMPORTANT]
5654
> Quotas and limits are subject to change, for the most up-date-information consult our [quotas and limits article](../quotas-limits.md).
@@ -70,10 +68,10 @@ For an all up view of your quota allocations across deployments in a given regio
7068

7169
:::image type="content" source="../media/quota/quota.png" alt-text="Screenshot of the quota UI of Azure AI Studio" lightbox="../media/quota/quota.png":::
7270

73-
- **Quota Name**: There's one quota value per region for each model type. The quota covers all versions of that model. The quota name can be expanded in the UI to show the deployments that are using the quota.
7471
- **Deployment**: Model deployments divided by model class.
75-
- **Usage/Limit**: For the quota name, this shows how much quota is used by deployments and the total quota approved for this subscription and region. This amount of quota used is also represented in the bar graph.
76-
- **Request Quota**: The icon in this field navigates to a form where requests to increase quota can be submitted.
72+
- **Quota type**: There's one quota value per region for each model type. The quota covers all versions of that model.
73+
- **Quota allocation**: For the quota name, this shows how much quota is used by deployments and the total quota approved for this subscription and region. This amount of quota used is also represented in the bar graph.
74+
- **Request Quota**: The icon navigates to a form where requests to increase quota can be submitted.
7775

7876
## Migrating existing deployments
7977

27.6 KB
Loading
107 KB
Loading

0 commit comments

Comments
 (0)