You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In this how-to article, you learn how to use Azure Machine Learning designer to retrain a machine learning model. Find out how to use published pipelines to automate machine learning workflows for retraining.
17
+
In this how-to article, you learn how to use Azure Machine Learning designer to retrain a machine learning model. You will use published pipelines to automate your workflow and set parameters to train your model on new data.
18
18
19
19
In this article, you learn how to:
20
20
21
21
> [!div class="checklist"]
22
22
> * Train a machine learning model.
23
23
> * Create a pipeline parameter.
24
24
> * Publish your training pipeline.
25
-
> * Retrain your model.
25
+
> * Retrain your model with new parameters.
26
26
27
27
## Prerequisites
28
28
29
-
* An Azure subscription. If you don't have an Azure subscription, create a [free account](https://aka.ms/AMLFree).
30
29
* An Azure Machine Learning workspace with the Enterprise SKU.
30
+
* A dataset accessible to the designer. This can be one of the following:
31
+
* An Azure Machine Learning registered dataset
32
+
33
+
**-or-**
34
+
* A data file stored in an Azure Machine Learning datastore.
35
+
36
+
For information on data access using the designer see [How to import data into the designer](how-to-designer-import-data.md).
31
37
32
-
This article assumes that you have basic knowledge of building pipelines in the designer. For a guided introduction to the designer, complete the [tutorial](tutorial-designer-automobile-price-train-score.md).
38
+
This article also assumes that you have basic knowledge of building pipelines in the designer. For a guided introduction, complete the [tutorial](tutorial-designer-automobile-price-train-score.md).
33
39
34
40
### Sample pipeline
35
41
36
-
The pipeline used in this article is an altered version of the one found in [Sample 3: Income prediction](how-to-designer-sample-classification-predict-income.md). It uses the [Import Data](algorithm-module-reference/import-data.md) module instead of the sample dataset to show you how to train a model by using your own data.
42
+
The pipeline used in this article is an altered version of [Sample 3: Income prediction](samples-designer.md#classification-samples). The pipeline uses the [Import Data](algorithm-module-reference/import-data.md) module instead of the sample dataset to show you how to train models using your own data.
37
43
38
44

39
45
40
-
## Train a machine learning model
46
+
## Create a pipeline parameter
41
47
42
-
To retrain a model, you need an initial model. In this section, you learn how to train a model and access the saved model by using the designer.
48
+
Create pipeline parameters to dynamically set variables at runtime. For this example, you will change the training data path from a fixed value to a parameter, so that you can retrain your model on different data.
43
49
44
50
1. Select the **Import Data** module.
45
-
1. On the properties pane, specify a data source.
46
-
47
-

48
51
49
-
For this example, the data is stored in an [Azure datastore](how-to-access-data.md). If you don't already have a datastore, you can create one now by selecting **New datastore**.
52
+
> [!NOTE]
53
+
> This example uses the Import Data module to access data in a registered datastore. However, you can follow similar steps if you use alternative data access patterns.
50
54
51
-
1. Specify the path to your data. You can also select **Browse path** to browse to your datastore.
52
-
1. Select **Submit** at the top of the canvas.
53
-
54
-
> [!NOTE]
55
-
> If you have already set a default compute for this pipeline draft, the pipeline will run automatically. Otherwise, you can follow the prompts on the settings pane to set one now.
55
+
1. In the module detail pane, to the right of the canvas, select your data source.
56
56
57
-
### Find your trained model
57
+
1. Enter the path to your data. You can also select **Browse path** to browse your file tree.
58
58
59
-
The designer saves all pipeline outputs, including trained models, to the default storage account. However, you can also access trained models directly in the designer:
59
+
1. Mouseover the **Path** field, and select the ellipses above the **Path** field that appear.
60
60
61
-
1. Wait for the pipeline to finish running.
62
-
1. Select the **Train Model** module.
63
-
1. On the settings pane, select **Outputs+logs**.
64
-
1. Select the **View output** icon, and follow the instruction in the pop-up window to find the trained model.
65
-
66
-

67
-
68
-
## Create a pipeline parameter
61
+

69
62
70
-
Add pipeline parameters to dynamically set variables at runtime. For this pipeline, add a parameter for the training data path so that you can retrain your model on a new dataset.
63
+
1. Select **Add to pipelineparameter**.
71
64
72
-
1. Select the **Import Data** module.
73
-
1. In the settings pane, select the ellipses above the **Path** field.
74
-
1. Select **Add to pipeline parameter**.
75
65
1. Provide a parameter name and a default value.
76
66
77
67
> [!NOTE]
78
68
> You can inspect and edit your pipeline parameters by selecting the **Settings** gear icon next to the title of your pipeline draft.
79
69
80
-

70
+
1. Select **Save**.
71
+
72
+
1. Submit the pipeline run.
73
+
74
+
## Find a trained model
75
+
76
+
The designer saves all pipeline output, including trained models, to the default workspace storage account. You can also access trained models directly in the designer:
77
+
78
+
1. Wait for the pipeline to finish running.
79
+
1. Select the **Train Model** module.
80
+
1. In the module details pane, to the right of the canvas, select **Outputs + logs**.
81
+
1. You can find your model in **Other outputs** along with run logs.
82
+
1. Alternatively, select the **View output** icon. From here, you can follow the instruction in the dialog to navigate directly to your datastore.
83
+
84
+

81
85
82
86
## Publish a training pipeline
83
87
84
-
When you publish a pipeline, it creates a pipeline endpoint. Pipeline endpoints let you reuse and manage your pipelines for repeatability and automation. In this example, you have set up your pipeline for retraining.
88
+
Publish a pipeline to a pipeline endpoint to easily reuse your pipelines in the future. A pipeline endpoint creates a REST endpoint to invoke pipeline in the future. In this example, your pipeline endpoint lets you reuse your pipeline to retrain a model on different data.
85
89
86
90
1. Select **Publish** above the designer canvas.
87
91
1. Select or create a pipeline endpoint.
88
92
89
93
> [!NOTE]
90
-
> You can publish multiple pipelines to a single endpoint. Each pipeline in the endpoint is given a version number, which you can specify when you call the pipeline endpoint.
94
+
> You can publish multiple pipelines to a single endpoint. Each pipeline in a given endpoint is given a version number, which you can specify when you call the pipeline endpoint.
91
95
92
96
1. Select **Publish**.
93
97
94
98
## Retrain your model
95
99
96
-
Now that you have a published training pipeline, you can use it to retrain your model by using new data. You can submit runs from a pipeline endpoint from the Azure portal or submit them programmatically.
100
+
Now that you have a published training pipeline, you can use it to retrain your model on new data. You can submit runs from a pipeline endpoint from the studio workspace or programmatically.
97
101
98
102
### Submit runs by using the designer
99
103
100
-
Use the following steps to submit a pipeline endpoint run from the designer:
104
+
Use the following steps to submit a parameterized pipeline endpoint run from the designer:
101
105
102
-
1. Go to the **Endpoints** page.
103
-
1. Select the **Pipeline endpoints** tab.
104
-
1. Select your pipeline endpoint.
105
-
1. Select the **Published pipelines** tab.
106
-
1. Select the pipeline that you want to run.
106
+
1. Go to the **Endpoints** page in your studio workspace.
107
+
1. Select the **Pipeline endpoints** tab. Then, select your pipeline endpoint.
108
+
1. Select the **Published pipelines** tab. Then, select the pipeline version that you want to run.
107
109
1. Select **Submit**.
108
-
1. In the setup dialog box, you can specify a new value for the input data path value. This value points to your new dataset.
110
+
1. In the setup dialog box, you can specify the parameters values for the run. For this example, update the data path to train your model using a non-US dataset.
109
111
110
112

111
113
@@ -117,4 +119,6 @@ To make a REST call, you need an OAuth 2.0 bearer-type authentication header. Fo
117
119
118
120
## Next steps
119
121
120
-
Follow the [designer tutorial](tutorial-designer-automobile-price-train-score.md) to train and deploy a regression model.
122
+
In this article, you learned how to create a parameterized training pipeline endpoint using the designer.
123
+
124
+
For a complete walkthrough of how you can deploy a model to make predictions, see the [designer tutorial](tutorial-designer-automobile-price-train-score.md) to train and deploy a regression model.
0 commit comments