Skip to content

Commit f095f7f

Browse files
committed
v1 train models
1 parent 1bbea9d commit f095f7f

File tree

1 file changed

+37
-31
lines changed

1 file changed

+37
-31
lines changed

articles/machine-learning/v1/concept-train-machine-learning-model.md

Lines changed: 37 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.author: manashg
99
ms.reviewer: ssalgado
1010
ms.subservice: core
1111
ms.topic: concept-article
12-
ms.date: 08/30/2022
12+
ms.date: 09/10/2025
1313
ms.custom: UpdateFrequency5, devx-track-python, devx-track-azurecli
1414
ms.devlang: azurecli
1515
---
@@ -20,35 +20,36 @@ ms.devlang: azurecli
2020

2121
[!INCLUDE [v1 deprecation](../includes/sdk-v1-deprecation.md)]
2222

23-
Azure Machine Learning provides several ways to train your models, from code-first solutions using the SDK to low-code solutions such as automated machine learning and the visual designer. Use the following list to determine which training method is right for you:
23+
Azure Machine Learning offers multiple ways to train models, including code-first solutions with the SDK and low-code options like automated machine learning and the visual designer. Use the following list to determine which training method fits your needs:
2424

2525
+ [Azure Machine Learning SDK for Python](#python-sdk): The Python SDK provides several ways to train models, each with different capabilities.
2626

2727
| Training method | Description |
2828
| ----- | ----- |
29-
| [Run configuration](#run-configuration) | A **typical way to train models** is to use a training script and job configuration. The job configuration provides the information needed to configure the training environment used to train your model. You can specify your training script, compute target, and Azure Machine Learning environment in your job configuration and run a training job. |
30-
| [Automated machine learning](#automated-machine-learning) | Automated machine learning allows you to **train models without extensive data science or programming knowledge**. For people with a data science and programming background, it provides a way to save time and resources by automating algorithm selection and hyperparameter tuning. You don't have to worry about defining a job configuration when using automated machine learning. |
31-
| [Machine learning pipeline](#machine-learning-pipeline) | Pipelines are not a different training method, but a **way of defining a workflow using modular, reusable steps** that can include training as part of the workflow. Machine learning pipelines support using automated machine learning and run configuration to train models. Since pipelines are not focused specifically on training, the reasons for using a pipeline are more varied than the other training methods. Generally, you might use a pipeline when:<br>* You want to **schedule unattended processes** such as long running training jobs or data preparation.<br>* Use **multiple steps** that are coordinated across heterogeneous compute resources and storage locations.<br>* Use the pipeline as a **reusable template** for specific scenarios, such as retraining or batch scoring.<br>* **Track and version data sources, inputs, and outputs** for your workflow.<br>* Your workflow is **implemented by different teams that work on specific steps independently**. Steps can then be joined together in a pipeline to implement the workflow. |
29+
| [Run configuration](#run-configuration) | A **common way to train models** is to use a training script and job configuration. The job configuration defines the training environment, including your script, compute target, and Azure Machine Learning environment. You can run a training job by specifying these details. |
30+
| [Automated machine learning](#automated-machine-learning) | Automated machine learning lets you **train models without deep data science or programming expertise**. For experienced users, it saves time by automating algorithm selection and hyperparameter tuning. Job configuration is not required when using automated machine learning. |
31+
| [Machine learning pipeline](#machine-learning-pipeline) | Pipelines are not a separate training method, but a **way to define workflows using modular, reusable steps** that can include training. Pipelines support both automated machine learning and run configuration. Use a pipeline when you want to:<br>* **Schedule unattended processes** like long-running training jobs or data preparation.<br>* Coordinate **multiple steps** across different compute resources and storage locations.<br>* Create a **reusable template** for scenarios such as retraining or batch scoring.<br>* **Track and version data sources, inputs, and outputs** for your workflow.<br>* Enable **different teams to work on specific steps independently** and combine them in a pipeline. |
3232

33-
+ **Designer**: Azure Machine Learning designer provides an easy entry-point into machine learning for building proof of concepts, or for users with little coding experience. It allows you to train models using a drag and drop web-based UI. You can use Python code as part of the design, or train models without writing any code.
33+
+ **Designer**: Azure Machine Learning designer is an easy entry point for building proof of concepts or for users with limited coding experience. Train models using a drag-and-drop web UI. You can include Python code or train models without writing any code.
3434

35-
+ **Azure CLI**: The machine learning CLI provides commands for common tasks with Azure Machine Learning, and is often used for **scripting and automating tasks**. For example, once you've created a training script or pipeline, you might use the Azure CLI to start a training job on a schedule or when the data files used for training are updated. For training models, it provides commands that submit training jobs. It can submit jobs using run configurations or pipelines.
35+
+ **Azure CLI**: The machine learning CLI offers commands for common Azure Machine Learning tasks and is often used for **scripting and automation**. For example, after creating a training script or pipeline, you can use the CLI to start a training job on a schedule or when training data is updated. The CLI can submit jobs using run configurations or pipelines.
3636

37-
Each of these training methods can use different types of compute resources for training. Collectively, these resources are referred to as [__compute targets__](concept-azure-machine-learning-architecture.md#compute-targets). A compute target can be a local machine or a cloud resource, such as an Azure Machine Learning Compute, Azure HDInsight, or a remote virtual machine.
37+
Each training method can use different types of compute resources, called [__compute targets__](concept-azure-machine-learning-architecture.md#compute-targets). A compute target can be a local machine or a cloud resource, such as Azure Machine Learning Compute, Azure HDInsight, or a remote virtual machine.
3838

3939
## Python SDK
4040

41-
The Azure Machine Learning SDK for Python allows you to build and run machine learning workflows with Azure Machine Learning. You can interact with the service from an interactive Python session, Jupyter Notebooks, Visual Studio Code, or other IDE.
41+
The Azure Machine Learning SDK for Python lets you build and run machine learning workflows. You can interact with the service from an interactive Python session, Jupyter Notebooks, Visual Studio Code, or other IDE.
4242

4343
* [What is the Azure Machine Learning SDK for Python](/python/api/overview/azure/ml/intro)
4444
* [Install/update the SDK](/python/api/overview/azure/ml/install)
4545
* [Configure a development environment for Azure Machine Learning](how-to-configure-environment.md)
4646

4747
### Run configuration
4848

49-
A generic training job with Azure Machine Learning can be defined using the [ScriptRunConfig](/python/api/azureml-core/azureml.core.scriptrunconfig). The script run configuration is then used, along with your training script(s) to train a model on a compute target.
5049

51-
You may start with a run configuration for your local computer, and then switch to one for a cloud-based compute target as needed. When changing the compute target, you only change the run configuration you use. A run also logs information about the training job, such as the inputs, outputs, and logs.
50+
A typical training job in Azure Machine Learning is defined using [ScriptRunConfig](/python/api/azureml-core/azureml.core.scriptrunconfig). The script run configuration, together with your training script(s), is used to train a model on a compute target.
51+
52+
You can start with a run configuration for your local computer and switch to a cloud-based compute target as needed. To change the compute target, update the run configuration. Each run logs information about the training job, including inputs, outputs, and logs.
5253

5354
* [What is a run configuration?](concept-azure-machine-learning-architecture.md#run-configurations)
5455
* [Tutorial: Train your first ML model](tutorial-1st-experiment-sdk-train.md)
@@ -57,10 +58,11 @@ You may start with a run configuration for your local computer, and then switch
5758

5859
### Automated Machine Learning
5960

60-
Define the iterations, hyperparameter settings, featurization, and other settings. During training, Azure Machine Learning tries different algorithms and parameters in parallel. Training stops once it hits the exit criteria you defined.
61+
Define iterations, hyperparameter settings, featurization, and other options. During training, Azure Machine Learning tests different algorithms and parameters in parallel. Training stops when it meets the exit criteria you set.
62+
6163

6264
> [!TIP]
63-
> In addition to the Python SDK, you can also use Automated ML through [Azure Machine Learning studio](https://ml.azure.com).
65+
> You can also use Automated ML through [Azure Machine Learning studio](https://ml.azure.com), in addition to the Python SDK.
6466
6567
* [What is automated machine learning?](../concept-automated-ml.md)
6668
* [Tutorial: Create your first classification model with automated machine learning](../tutorial-first-experiment-automated-ml.md)
@@ -71,7 +73,8 @@ Define the iterations, hyperparameter settings, featurization, and other setting
7173

7274
### Machine learning pipeline
7375

74-
Machine learning pipelines can use the previously mentioned training methods. Pipelines are more about creating a workflow, so they encompass more than just the training of models. In a pipeline, you can train a model using automated machine learning or run configurations.
76+
77+
Machine learning pipelines can use the training methods described above. Pipelines focus on creating workflows, so they cover more than just model training. In a pipeline, you can train a model using automated machine learning or run configurations.
7578

7679
* [What are ML pipelines in Azure Machine Learning?](../concept-ml-pipelines.md)
7780
* [Create and run machine learning pipelines with Azure Machine Learning SDK](how-to-create-machine-learning-pipelines.md)
@@ -81,40 +84,43 @@ Machine learning pipelines can use the previously mentioned training methods. Pi
8184

8285
### Understand what happens when you submit a training job
8386

84-
The Azure training lifecycle consists of:
87+
88+
The Azure training lifecycle includes:
8589

8690
1. Zipping the files in your project folder, ignoring those specified in _.amlignore_ or _.gitignore_
87-
1. Scaling up your compute cluster
88-
1. Building or downloading the dockerfile to the compute node
89-
1. The system calculates a hash of:
90-
- The base image
91+
2. Scaling up your compute cluster
92+
3. Building or downloading the Docker image to the compute node
93+
1. The system calculates a hash of:
94+
- The base image
9195
- The conda definition YAML (see [Create & use software environments in Azure Machine Learning](how-to-use-environments.md))
92-
1. The system uses this hash as the key in a lookup of the workspace Azure Container Registry (ACR)
93-
1. If it is not found, it looks for a match in the global ACR
94-
1. If it is not found, the system builds a new image (which will be cached and registered with the workspace ACR)
95-
1. Downloading your zipped project file to temporary storage on the compute node
96-
1. Unzipping the project file
97-
1. The compute node executing `python <entry script> <arguments>`
98-
1. Saving logs, model files, and other files written to `./outputs` to the storage account associated with the workspace
99-
1. Scaling down compute, including removing temporary storage
96+
2. The system uses this hash to look up the workspace Azure Container Registry (ACR)
97+
3. If not found, it checks the global ACR
98+
4. If still not found, the system builds a new image (which is cached and registered with the workspace ACR)
99+
4. Downloading your zipped project file to temporary storage on the compute node
100+
5. Unzipping the project file
101+
6. The compute node executes `python <entry script> <arguments>`
102+
7. Saving logs, model files, and other files written to `./outputs` to the storage account associated with the workspace
103+
8. Scaling down compute, including removing temporary storage
100104

101-
If you choose to train on your local machine ("configure as local run"), you do not need to use Docker. You may use Docker locally if you choose (see the section [Configure ML pipeline](how-to-debug-pipelines.md) for an example).
105+
If you train on your local machine ("configure as local run"), Docker is not required. You can use Docker locally if you prefer (see [Configure ML pipeline](how-to-debug-pipelines.md) for an example).
102106

103107
## Azure Machine Learning designer
104108

105-
The designer lets you train models using a drag and drop interface in your web browser.
109+
110+
The designer lets you train models using a drag-and-drop interface in your web browser.
106111

107112
+ [What is the designer?](concept-designer.md)
108113
+ [Tutorial: Predict automobile price](tutorial-designer-automobile-price-train-score.md)
109114

110115
## Azure CLI
111116

112-
The machine learning CLI is an extension for the Azure CLI. It provides cross-platform CLI commands for working with Azure Machine Learning. Typically, you use the CLI to automate tasks, such as training a machine learning model.
117+
118+
The machine learning CLI is an extension for the Azure CLI. It provides cross-platform commands for working with Azure Machine Learning. Typically, you use the CLI to automate tasks, such as training a machine learning model.
113119

114120
* [Use the CLI extension for Azure Machine Learning](reference-azure-machine-learning-cli.md)
115121
* [MLOps on Azure](https://github.com/microsoft/MLOps)
116122

117123

118124
## Next steps
119125

120-
Learn how to [Configure a training run](how-to-set-up-training-targets.md).
126+
Learn how to [configure a training run](how-to-set-up-training-targets.md).

0 commit comments

Comments
 (0)