You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/concept-plan-manage-cost.md
+9-8Lines changed: 9 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -44,7 +44,7 @@ The following screenshot shows the cost estimation by using the calculator:
44
44
45
45
As you add new resources to your workspace, return to this calculator and add the same resource here to update your cost estimates.
46
46
47
-
While the Enterprise edition is in preview, there is no ML surcharge. When the Enterprise edition becomes generally available, it will have a machine learning surcharge (for training and inferencing). For more details,[Azure Machine Learning pricing](https://azure.microsoft.com/pricing/details/machine-learning/).
47
+
While the Enterprise edition is in preview, there is no ML surcharge. When the Enterprise edition becomes generally available, it will have a surcharge (for training and inferencing). For more information, see[Azure Machine Learning pricing](https://azure.microsoft.com/pricing/details/machine-learning/).
48
48
49
49
## Get cost alerts
50
50
@@ -54,26 +54,27 @@ Create [budgets](../cost-management/tutorial-acm-create-budgets.md) to manage co
54
54
55
55
As you use resources with Azure Machine Learning, you incur costs. Azure resource usage unit costs vary by time intervals (seconds, minutes, hours, and days) or by request unit usage. As soon as usage of Azure Machine Learning starts, costs are incurred. View these costs in the [cost analysis](../cost-management/quick-acm-cost-analysis.md) pane in the Azure portal.
56
56
57
-
View costs in graphs and tables for different time intervals. Some examples are by day, current, prior month, and year. Also view costs against budgets and forecasted costs. Switching to longer views over time helps you identify spending trends and see where overspending might have occurred. If you've created budgets, see where they exceeded.
57
+
You can view costs in graphs and tables for different time intervals. You can also view costs against budgets and forecasted costs. Switching to longer views over time helps identify spending trends and see where overspending might have occurred. If you've created budgets, see where they exceeded.
58
58
59
59
You won't see a separate service area for Machine Learning. Instead you'll see the various resources you've added to your Machine Learning workspaces.
60
60
61
61
## Use Azure Machine Learning compute cluster (AmlCompute)
62
62
63
63
With constantly changing data, you need fast and streamlined model training and retraining to maintain accurate models. However, continuous training comes at a cost, especially for deep learning models on GPUs.
64
64
65
-
Azure Machine Learning users can use the managed Azure Machine Learning compute cluster, also called AmlCompute. AmlCompute supports a variety of GPU and CPU options. The AmlCompute is internally hosted on behalf of your subscription by Azure Machine Learning, but provides the same enterprise grade security, compliance and governance at Azure IaaS cloud scale.
65
+
Azure Machine Learning users can use the managed Azure Machine Learning compute cluster, also called AmlCompute. AmlCompute supports a variety of GPU and CPU options. The AmlCompute is internally hosted on behalf of your subscription by Azure Machine Learning. It provides the same enterprise grade security, compliance and governance at Azure IaaS cloud scale.
66
66
67
67
Because these compute pools are inside of Azure's IaaS infrastructure, you can deploy, scale, and manage your training with the same security and compliance requirements as the rest of your infrastructure. These deployments occur in your subscription and obey your governance rules. Learn more about [Azure Machine Learning Compute](how-to-set-up-training-targets.md#amlcompute).
68
68
69
69
## Configure training clusters for autoscaling
70
70
71
-
Autoscaling clusters based on the requirements of your workload helps reduce your costs so you only use what you need.
72
-
AmlCompute clusters are designed to autoscale dynamically based on the requirements of your workload. The cluster can be scaled up to the maximum number of nodes provisioned and within the quota designated for the subscription. As each run completes, the cluster will release nodes and autoscale to your designated minimum node count.
71
+
Autoscaling clusters based on the requirements of your workload helps reduce your costs so you only use what you need.
72
+
73
+
AmlCompute clusters are designed to scale dynamically based on your workload. The cluster can be scaled up to the maximum number of nodes you configure. As each run completes, the cluster will release nodes and scale to your configured minimum node count.
In addition to setting the minimum and maximum number of nodes, tweak the amount of time the node is idle before scale down. By default, idle time before scale down is set to 120 seconds.
77
+
You can also configure the amount of time the node is idle before scale down. By default, idle time before scale down is set to 120 seconds.
77
78
78
79
+ If you perform less iterative experimentation, reduce this time to save costs.
79
80
+ If you perform highly iterative dev/test experimentation, you might need to increase the time so you aren't paying for constant scaling up and down after each change to your training script or environment.
@@ -103,7 +104,7 @@ Here are a few options that you have:
103
104
104
105
## Use low-priority VMs
105
106
106
-
Azure allows you to use excess unutilized capacity as Low-Priority VMs across virtual machine scale sets, Batch, and the Machine Learning service. These allocations are pre-emptible but come at a reduced price compared to dedicated VMs. In general, we recommend using Low-Priority VMs for Batch workloads or where interruptions are recoverable either through resubmits (for Batch Inferencing) or through restarts (for deep learning training with checkpointing).
107
+
Azure allows you to use excess unutilized capacity as Low-Priority VMs across virtual machine scale sets, Batch, and the Machine Learning service. These allocations are pre-emptible but come at a reduced price compared to dedicated VMs. In general, we recommend using Low-Priority VMs for Batch workloads. You should also use them where interruptions are recoverable either through resubmits (for Batch Inferencing) or through restarts (for deep learning training with checkpointing).
107
108
108
109
Low-Priority VMs have a single quota separate from the dedicated quota value, which is by VM family. Learn [more about AmlCompute quotas](how-to-manage-quotas.md).
109
110
@@ -131,7 +132,7 @@ Set the priority of your VM in any of these ways:
131
132
132
133
Azure Reserved VM Instance provides another way to get huge savings on compute resources by committing to one-year or three-year terms. These discounts range up to 72% of the pay-as-you-go prices and are applied directly to your monthly Azure bill.
133
134
134
-
Azure Machine Learning Compute supports reserved instances inherently. So ifyou have purchased a one-year or three-year reserved instance we will automatically apply that reserved instance discount against the managed compute that isused within Azure Machine Learning without requiringany additional setup from your end.
135
+
Azure Machine Learning Compute supports reserved instances inherently. If you have purchased a one-year or three-year reserved instance we will automatically apply that reserved instance discount against the managed compute used within Azure Machine Learning. This does not requireany additional setup from your end.
0 commit comments