You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# Customer intent: As an admin, I want to understand how to manage and increase quotas and limits for resources with Azure Machine Learning.
14
15
---
15
16
16
17
# Manage and increase quotas and limits for resources with Azure Machine Learning
17
18
18
19
Azure uses quotas and limits to prevent budget overruns due to fraud, and to honor Azure capacity constraints. Consider these limits as you scale for production workloads. In this article, you learn about:
19
20
20
-
> [!div class="checklist"]
21
-
> + Default limits on Azure resources related to [Azure Machine Learning](overview-what-is-azure-machine-learning.md).
22
-
> + Creating workspace-level quotas.
23
-
> + Viewing your quotas and limits.
24
-
> + Requesting quota increases.
21
+
- Default limits on Azure resources related to [Azure Machine Learning](overview-what-is-azure-machine-learning.md).
22
+
- Creating workspace-level quotas.
23
+
- Viewing your quotas and limits.
24
+
- Requesting quota increases.
25
25
26
26
Along with managing quotas and limits, you can learn how to [plan and manage costs for Azure Machine Learning](concept-plan-manage-cost.md) or learn about the [service limits in Azure Machine Learning](resource-limits-capacity.md).
27
27
@@ -83,7 +83,7 @@ To raise the limits for the following items, [Request a quota increase](#request
83
83
* Other resources in this section
84
84
85
85
Available resources:
86
-
+**Dedicated cores per region** have a default limit of 24 to 300, depending on your subscription offer type. You can increase the number of dedicated cores per subscription for each VM family. Specialized VM families like NCv2, NCv3, or ND series start with a default of zero cores. GPUs also default to zero cores.
86
+
+**Dedicated cores per region** have a default limit of 24 to 300, depending on your subscription offer type. You can increase the number of dedicated cores per subscription for each VM family. Specialized VM families like NCv2, NCv3, or ND series start with a default of zero cores. GPUs also default to zero cores.
87
87
88
88
+**Low-priority cores per region** have a default limit of 100 to 3,000, depending on your subscription offer type. The number of low-priority cores per subscription can be increased and is a single value across VM families.
89
89
@@ -121,7 +121,7 @@ You should use the shared quota only for creating temporary test endpoints, not
121
121
Azure Machine Learning online endpoints and batch endpoints have resource limits described in the following table.
122
122
123
123
> [!IMPORTANT]
124
-
> These limits are _regional_, meaning that you can use up to these limits per each region you're using. For example, if your current limit for number of endpoints per subscription is 100, you can create 100 endpoints in the East US region, 100 endpoints in the West US region, and 100 endpoints in each of the other supported regions in a single subscription. Same principle applies to all the other limits.
124
+
> These limits are *regional*, meaning that you can use up to these limits per each region you're using. For example, if your current limit for number of endpoints per subscription is 100, you can create 100 endpoints in the East US region, 100 endpoints in the West US region, and 100 endpoints in each of the other supported regions in a single subscription. Same principle applies to all the other limits.
125
125
126
126
To determine the current usage for an endpoint, [view the metrics](how-to-monitor-online-endpoints.md#metrics).
127
127
@@ -144,11 +144,11 @@ To request an exception from the Azure Machine Learning product team, use the st
144
144
| Total connections active at endpoint level for all deployments | 500 <sup>5</sup> | Yes | Managed online endpoint |
145
145
| Total bandwidth at endpoint level for all deployments | 5 MBPS <sup>5</sup> | Yes | Managed online endpoint |
146
146
147
-
<sup>1</sup> This is a regional limit. For example, if current limit on number of endpoint is 100, you can create 100 endpoints in the East US region, 100 endpoints in the West US region, and 100 endpoints in each of the other supported regions in a single subscription. Same principle applies to all the other limits.
147
+
<sup>1</sup> This is a regional limit. For example, if current limit on number of endpoints is 100, you can create 100 endpoints in the East US region, 100 endpoints in the West US region, and 100 endpoints in each of the other supported regions in a single subscription. Same principle applies to all the other limits.
148
148
149
149
<sup>2</sup> Single dashes like, `my-endpoint-name`, are accepted in endpoint and deployment names.
150
150
151
-
<sup>3</sup> Endpoints and deployments can be of different types, but limits apply to the sum of all types. For example, the sum of managed online endpoints, Kubernetes online endpoint and batch endpoint under each subscription can't exceed 100 per region by default. Similarly, the sum of managed online deployments, Kubernetes online deployments and batch deployments under each subscription can't exceed 500 per region by default.
151
+
<sup>3</sup> Endpoints and deployments can be of different types, but limits apply to the sum of all types. For example, the sum of managed online endpoints, Kubernetes online endpoint and batch endpoint under each subscription can't exceed 100 per region by default. Similarly, the sum of managed online deployments, Kubernetes online deployments, and batch deployments under each subscription can't exceed 500 per region by default.
152
152
153
153
<sup>4</sup> We reserve 20% extra compute resources for performing upgrades. For example, if you request 10 instances in a deployment, you must have a quota for 12. Otherwise, you receive an error. There are some VM SKUs that are exempt from extra quota. For more information on quota allocation, see [virtual machine quota allocation for deployment](#virtual-machine-quota-allocation-for-deployment).
154
154
@@ -231,47 +231,47 @@ To view your quota for various Azure resources like virtual machines, storage, o
231
231
232
232
1. On the left pane, select **All services** and then select **Subscriptions** under the **General** category.
233
233
234
-
2. From the list of subscriptions, select the subscription whose quota you're looking for.
234
+
1. From the list of subscriptions, select the subscription whose quota you're looking for.
235
235
236
-
3. Select **Usage + quotas** to view your current quota limits and usage. Use the filters to select the provider and locations.
236
+
1. Select **Usage + quotas** to view your current quota limits and usage. Use the filters to select the provider and locations.
237
237
238
-
You manage the Azure Machine Learning compute quota on your subscription separately from other Azure quotas:
238
+
You manage the Azure Machine Learning compute quota on your subscription separately from other Azure quotas:
239
239
240
240
1. Go to your **Azure Machine Learning** workspace in the Azure portal.
241
241
242
-
2. On the left pane, in the **Support + troubleshooting** section, select **Usage + quotas** to view your current quota limits and usage.
242
+
1. On the left pane, in the **Support + troubleshooting** section, select **Usage + quotas** to view your current quota limits and usage.
243
243
244
-
[](./media/how-to-manage-quotas/portal-view-quota.png)
244
+
[](./media/how-to-manage-quotas/portal-view-quota.png)
245
245
246
-
3. Select a subscription to view the quota limits. Filter to the region you're interested in.
246
+
1. Select a subscription to view the quota limits. Filter to the region you're interested in.
247
247
248
-
4. You can switch between a subscription-level view and a workspace-level view.
248
+
1. You can switch between a subscription-level view and a workspace-level view.
249
249
250
250
251
251
## Request quota and limit increases
252
252
253
-
VM quota increase is to increase the number of cores per VM family per region. Endpoint limit increase is to increase the endpoint-specific limits per subscription per region. Make sure to choose the right category when you are submitting the quota increase request, as described in the next section.
253
+
VM quota increase is to increase the number of cores per VM family per region. Endpoint limit increase is to increase the endpoint-specific limits per subscription per region. Make sure to choose the right category when you're submitting the quota increase request, as described in the next section.
254
254
255
255
### VM quota increases
256
256
257
257
To raise the limit for Azure Machine Learning VM quota above the default limit, you can request for quota increase from the above **Usage + quotas** view or submit a quota increase request from Azure Machine Learning studio.
258
258
259
259
1. Navigate to the **Usage + quotas** page by following the above instructions. View the current quota limits. Select the SKU for which you'd like to request an increase.
260
260
261
-
[](./media/how-to-manage-quotas/mlstudio-request-quota.png)
261
+
[](./media/how-to-manage-quotas/mlstudio-request-quota.png)
262
262
263
-
2. Provide the quota you'd like to increase and the new limit value. Finally, select __Submit__ to continue.
263
+
1. Provide the quota you'd like to increase and the new limit value. Finally, select **Submit** to continue.
264
264
265
-
[](./media/how-to-manage-quotas/mlstudio-new-quota-limit.png)
265
+
[](./media/how-to-manage-quotas/mlstudio-new-quota-limit.png)
266
266
267
267
### Endpoint limit increases
268
268
269
269
To raise endpoint limit, [open an online customer support request](https://portal.azure.com/#blade/Microsoft_Azure_Support/HelpAndSupportBlade/newsupportrequest/). When requesting for endpoint limit increase, provide the following information:
270
270
271
-
1. When opening the support request, select __Service and subscription limits (quotas)__ as the __Issue type__.
271
+
1. When opening the support request, select **Service and subscription limits (quotas)** as the **Issue type**.
272
272
1. Select the subscription of your choice.
273
-
1. Select __Machine Learning Service: Endpoint Limits__ as the __Quota type__.
274
-
1. On the __Additional details__ tab, you need to provide detailed reasons for the limit increase in order for your request to be processed. Select __Enter details__ and then provide the limit you'd like to increase and the new value for each limit, the reason for the limit increase request, and __location(s)__ where you need the limit increase.
273
+
1. Select **Machine Learning Service: Endpoint Limits** as the **Quota type**.
274
+
1. On the **Additional details** tab, you need to provide detailed reasons for the limit increase in order for your request to be processed. Select **Enter details** and then provide the limit you'd like to increase and the new value for each limit, the reason for the limit increase request, and **location(s)** where you need the limit increase.
275
275
Be sure to add the following information into the reason for limit increase:
276
276
1. Description of your scenario and workload (such as text, image, and so on).
277
277
1. Rationale for the requested increase.
@@ -281,35 +281,35 @@ Be sure to add the following information into the reason for limit increase:
281
281
1. Confirm if you have a benchmark test that indicates the selected VM SKU and the number of instances that would meet your throughput and latency requirement.
282
282
1. Provide the type of the payload and size of a single payload. Network bandwidth should align with the payload size and requests per second.
283
283
1. Provide planned time plan (by when you need increased limits - provide staged plan if possible) and confirm if (1) the cost of running it at that scale is reflected in your budget and (2) the target VM SKUs are approved.
284
-
1. Finally, select __Save and continue__ to continue.
284
+
1. Finally, select **Save and continue** to continue.
285
285
286
-
[](./media/how-to-manage-quotas/quota-details.png)
286
+
[](./media/how-to-manage-quotas/quota-details.png)
287
287
288
-
> [!NOTE]
289
-
> This endpoint limit increase request is different from VM quota increase request. If your request is related to VM quota increase, follow the instructions in the [VM quota increases](#vm-quota-increases) section.
288
+
> [!NOTE]
289
+
> This endpoint limit increase request is different from VM quota increase request. If your request is related to VM quota increase, follow the instructions in the [VM quota increases](#vm-quota-increases) section.
290
290
291
291
### Compute limit increases
292
292
293
293
In order to increase the total compute limit, [open an online customer support request](https://ms.portal.azure.com/#view/Microsoft_Azure_Support/NewSupportRequestV3Blade/callerWorkflowId/5088c408-f627-4398-9aa3-c41cdd93a6eb/callerName/Microsoft_Azure_Support%2FHelpAndSupportOverview.ReactView). Provide the following information:
294
294
295
-
1. When opening the support request, select __Technical__ as the __Issue type__.
295
+
1. When opening the support request, select **Technical** as the **Issue type**.
296
296
1. Select the subscription of your choice
297
-
1. Select __Machine Learning__ as the __Service__.
297
+
1. Select **Machine Learning** as the **Service**.
298
298
1. Select the resource of your choice
299
299
1. In the summary, mention "Increase total compute limits"
300
-
1. Select __Compute Cluster__ as the __Problem type__ and __Cluster does not scale up or is stuck in resizing__ as the __Problem subtype__.
300
+
1. Select **Compute Cluster** as the **Problem type** and **Cluster does not scale up or is stuck in resizing** as the **Problem subtype**.
301
301
302
302
:::image type="content" source="media/how-to-manage-quotas/problem-description.png" alt-text="Screenshot of the problem description tab.":::
303
303
304
-
1. On the __Additional details__ tab, provide the subscription ID, region, new limit (between 500 and 2500) and business justification if you would like to increase the total compute limits in this region.
304
+
1. On the **Additional details** tab, provide the subscription ID, region, new limit (between 500 and 2500) and business justification if you would like to increase the total compute limits in this region.
305
305
306
306
:::image type="content" source="media/how-to-manage-quotas/additional-details.png" alt-text="Screenshot of the additional details tab.":::
307
307
308
-
1. Finally, select __Create__ to create a support request ticket.
308
+
1. Finally, select **Create** to create a support request ticket.
309
309
310
310
311
-
## Next steps
311
+
## Related content
312
312
313
-
+[Plan and manage costs for Azure Machine Learning](concept-plan-manage-cost.md)
314
-
+[Service limits in Azure Machine Learning](resource-limits-capacity.md)
315
-
+[Troubleshooting managed online endpoints deployment and scoring](./how-to-troubleshoot-online-endpoints.md)
313
+
-[Plan and manage costs for Azure Machine Learning](concept-plan-manage-cost.md)
314
+
-[Service limits in Azure Machine Learning](resource-limits-capacity.md)
315
+
-[Troubleshooting managed online endpoints deployment and scoring](./how-to-troubleshoot-online-endpoints.md)
0 commit comments