Skip to content

Commit 626ff85

Browse files
committed
update quota information in several online endpoint articles
1 parent 50cb788 commit 626ff85

File tree

3 files changed

+14
-3
lines changed

3 files changed

+14
-3
lines changed

articles/machine-learning/concept-endpoints-online.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -140,9 +140,9 @@ To learn how to deploy online endpoints using the CLI, SDK, studio, and ARM temp
140140

141141
[!INCLUDE [quota-allocation-online-deployment](includes/quota-allocation-online-deployment.md)]
142142

143-
Azure Machine Learning provides a [shared quota](how-to-manage-quotas.md#azure-machine-learning-shared-quota) pool from which all users can access quota to perform testing for a limited time. For some models in the model catalog (such as Llama models), when you use the studio for model deployment to a managed online endpoint, Azure Machine Learning allows you to access this shared quota for a short time.
143+
[!INCLUDE [machine-learning-shared-quota](includes/machine-learning-shared-quota.md)]
144144

145-
For models in the model catalog, such as _Llama-2-70b_ or _Llama-2-70b-chat_, you must have an [Enterprise Agreement subscription](../cost-management-billing/manage/create-enterprise-subscription.md) before you can deploy using the shared quota. For more information on how to use the shared quota for online endpoint deployment, see [How to deploy foundation models using the studio](how-to-use-foundation-models.md#shared-quota).
145+
To deploy Llama-2, Phi, Nemotron, Mistral, Dolly, and Deci-DeciLM models from the model catalog via the shared quota, you must have an [Enterprise Agreement subscription](../cost-management-billing/manage/create-enterprise-subscription.md). For more information on how to use the shared quota for online endpoint deployment, see [How to deploy foundation models using the studio](how-to-use-foundation-models.md#shared-quota).
146146

147147
For more information on quotas and limits for resources in Azure Machine Learning, see [Manage and increase quotas and limits for resources with Azure Machine Learning](how-to-manage-quotas.md).
148148

articles/machine-learning/how-to-deploy-online-endpoints.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -71,7 +71,8 @@ Before following the steps in this article, make sure you have the following pre
7171

7272
* Ensure that you have enough virtual machine (VM) quota allocated for deployment. Azure Machine Learning reserves 20% of your compute resources for performing upgrades on some VM SKUs. For example, if you request 10 instances in a deployment, you must have a quota for 12 for each number of cores for the VM SKU. Failure to account for the extra compute resources results in an error. There are some VM SKUs that are exempt from the extra quota reservation. For more information on quota allocation, see [virtual machine quota allocation for deployment](how-to-manage-quotas.md#virtual-machine-quota-allocation-for-deployment).
7373

74-
* Alternatively, you could use quota from Azure Machine Learning's shared quota pool for a limited time. Users can access quota from this pool to perform testing for a limited time. When you use the studio to deploy Llama-2, Phi, Nemotron, Mistral, Dolly, and Deci-DeciLM models from the model catalog to a managed online endpoint, Azure Machine Learning allows you to access its shared quota pool for a short time so that you can perform testing. For more information on the shared quota pool, see [Azure Machine Learning shared quota](how-to-manage-quotas.md#azure-machine-learning-shared-quota).
74+
* Alternatively, you could use quota from Azure Machine Learning's shared quota pool for a limited time.
75+
[!INCLUDE [machine-learning-shared-quota](includes/machine-learning-shared-quota.md)]
7576

7677

7778
## Prepare your system
Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
---
2+
author: msakande
3+
ms.service: machine-learning
4+
ms.topic: include
5+
ms.date: 08/14/2024
6+
ms.author: mopeakande
7+
---
8+
9+
Azure Machine Learning provides a shared quota pool from which users across various regions can access quota to perform testing for a limited time, depending upon availability.
10+
When you use the studio to deploy Llama-2, Phi, Nemotron, Mistral, Dolly, and Deci-DeciLM models from the model catalog to a managed online endpoint, Azure Machine Learning allows you to access its shared quota pool for a short time so that you can perform testing. For more information on the shared quota pool, see [Azure Machine Learning shared quota](how-to-manage-quotas.md#azure-machine-learning-shared-quota).

0 commit comments

Comments
 (0)