Skip to content

Commit b0160c3

Browse files
Merge pull request #110477 from PeterCLu/plu-retrain-update
Plu retrain update
2 parents 58a3bdc + 55be724 commit b0160c3

File tree

4 files changed

+47
-43
lines changed

4 files changed

+47
-43
lines changed

articles/machine-learning/how-to-retrain-designer.md

Lines changed: 47 additions & 43 deletions
Original file line numberDiff line numberDiff line change
@@ -8,104 +8,106 @@ ms.subservice: core
88
ms.topic: how-to
99
ms.author: keli19
1010
author: likebupt
11-
ms.date: 02/24/2020
11+
ms.date: 04/06/2020
1212
---
1313

1414
# Retrain models with Azure Machine Learning designer (preview)
1515
[!INCLUDE [applies-to-skus](../../includes/aml-applies-to-enterprise-sku.md)]
1616

17-
In this how-to article, you learn how to use Azure Machine Learning designer to retrain a machine learning model. Find out how to use published pipelines to automate machine learning workflows for retraining.
17+
In this how-to article, you learn how to use Azure Machine Learning designer to retrain a machine learning model. You will use published pipelines to automate your workflow and set parameters to train your model on new data.
1818

1919
In this article, you learn how to:
2020

2121
> [!div class="checklist"]
2222
> * Train a machine learning model.
2323
> * Create a pipeline parameter.
2424
> * Publish your training pipeline.
25-
> * Retrain your model.
25+
> * Retrain your model with new parameters.
2626
2727
## Prerequisites
2828

29-
* An Azure subscription. If you don't have an Azure subscription, create a [free account](https://aka.ms/AMLFree).
3029
* An Azure Machine Learning workspace with the Enterprise SKU.
30+
* A dataset accessible to the designer. This can be one of the following:
31+
* An Azure Machine Learning registered dataset
32+
33+
**-or-**
34+
* A data file stored in an Azure Machine Learning datastore.
35+
36+
For information on data access using the designer see [How to import data into the designer](how-to-designer-import-data.md).
3137

32-
This article assumes that you have basic knowledge of building pipelines in the designer. For a guided introduction to the designer, complete the [tutorial](tutorial-designer-automobile-price-train-score.md).
38+
This article also assumes that you have basic knowledge of building pipelines in the designer. For a guided introduction, complete the [tutorial](tutorial-designer-automobile-price-train-score.md).
3339

3440
### Sample pipeline
3541

36-
The pipeline used in this article is an altered version of the one found in [Sample 3: Income prediction](how-to-designer-sample-classification-predict-income.md). It uses the [Import Data](algorithm-module-reference/import-data.md) module instead of the sample dataset to show you how to train a model by using your own data.
42+
The pipeline used in this article is an altered version of [Sample 3: Income prediction](samples-designer.md#classification-samples). The pipeline uses the [Import Data](algorithm-module-reference/import-data.md) module instead of the sample dataset to show you how to train models using your own data.
3743

3844
![Screenshot that shows the modified sample pipeline with a box highlighting the Import Data module](./media/how-to-retrain-designer/modified-sample-pipeline.png)
3945

40-
## Train a machine learning model
46+
## Create a pipeline parameter
4147

42-
To retrain a model, you need an initial model. In this section, you learn how to train a model and access the saved model by using the designer.
48+
Create pipeline parameters to dynamically set variables at runtime. For this example, you will change the training data path from a fixed value to a parameter, so that you can retrain your model on different data.
4349

4450
1. Select the **Import Data** module.
45-
1. On the properties pane, specify a data source.
46-
47-
![Screenshot that shows a sample configuration of the Import Data module](./media/how-to-retrain-designer/import-data-settings.png)
4851

49-
For this example, the data is stored in an [Azure datastore](how-to-access-data.md). If you don't already have a datastore, you can create one now by selecting **New datastore**.
52+
> [!NOTE]
53+
> This example uses the Import Data module to access data in a registered datastore. However, you can follow similar steps if you use alternative data access patterns.
5054
51-
1. Specify the path to your data. You can also select **Browse path** to browse to your datastore.
52-
1. Select **Submit** at the top of the canvas.
53-
54-
> [!NOTE]
55-
> If you have already set a default compute for this pipeline draft, the pipeline will run automatically. Otherwise, you can follow the prompts on the settings pane to set one now.
55+
1. In the module detail pane, to the right of the canvas, select your data source.
5656

57-
### Find your trained model
57+
1. Enter the path to your data. You can also select **Browse path** to browse your file tree.
5858

59-
The designer saves all pipeline outputs, including trained models, to the default storage account. However, you can also access trained models directly in the designer:
59+
1. Mouseover the **Path** field, and select the ellipses above the **Path** field that appear.
6060

61-
1. Wait for the pipeline to finish running.
62-
1. Select the **Train Model** module.
63-
1. On the settings pane, select **Outputs+logs**.
64-
1. Select the **View output** icon, and follow the instruction in the pop-up window to find the trained model.
65-
66-
![Screenshot that shows how to download the trained model](./media/how-to-retrain-designer/trained-model-view-output.png)
67-
68-
## Create a pipeline parameter
61+
![Screenshot that shows how to create a pipeline parameter](media/how-to-retrain-designer/add-pipeline-parameter.png)
6962

70-
Add pipeline parameters to dynamically set variables at runtime. For this pipeline, add a parameter for the training data path so that you can retrain your model on a new dataset.
63+
1. Select **Add to pipeline parameter**.
7164

72-
1. Select the **Import Data** module.
73-
1. In the settings pane, select the ellipses above the **Path** field.
74-
1. Select **Add to pipeline parameter**.
7565
1. Provide a parameter name and a default value.
7666

7767
> [!NOTE]
7868
> You can inspect and edit your pipeline parameters by selecting the **Settings** gear icon next to the title of your pipeline draft.
7969
80-
![Screenshot that shows how to create a pipeline parameter](media/how-to-retrain-designer/add-pipeline-parameter.png)
70+
1. Select **Save**.
71+
72+
1. Submit the pipeline run.
73+
74+
## Find a trained model
75+
76+
The designer saves all pipeline output, including trained models, to the default workspace storage account. You can also access trained models directly in the designer:
77+
78+
1. Wait for the pipeline to finish running.
79+
1. Select the **Train Model** module.
80+
1. In the module details pane, to the right of the canvas, select **Outputs + logs**.
81+
1. You can find your model in **Other outputs** along with run logs.
82+
1. Alternatively, select the **View output** icon. From here, you can follow the instruction in the dialog to navigate directly to your datastore.
83+
84+
![Screenshot that shows how to download the trained model](./media/how-to-retrain-designer/trained-model-view-output.png)
8185

8286
## Publish a training pipeline
8387

84-
When you publish a pipeline, it creates a pipeline endpoint. Pipeline endpoints let you reuse and manage your pipelines for repeatability and automation. In this example, you have set up your pipeline for retraining.
88+
Publish a pipeline to a pipeline endpoint to easily reuse your pipelines in the future. A pipeline endpoint creates a REST endpoint to invoke pipeline in the future. In this example, your pipeline endpoint lets you reuse your pipeline to retrain a model on different data.
8589

8690
1. Select **Publish** above the designer canvas.
8791
1. Select or create a pipeline endpoint.
8892

8993
> [!NOTE]
90-
> You can publish multiple pipelines to a single endpoint. Each pipeline in the endpoint is given a version number, which you can specify when you call the pipeline endpoint.
94+
> You can publish multiple pipelines to a single endpoint. Each pipeline in a given endpoint is given a version number, which you can specify when you call the pipeline endpoint.
9195
9296
1. Select **Publish**.
9397

9498
## Retrain your model
9599

96-
Now that you have a published training pipeline, you can use it to retrain your model by using new data. You can submit runs from a pipeline endpoint from the Azure portal or submit them programmatically.
100+
Now that you have a published training pipeline, you can use it to retrain your model on new data. You can submit runs from a pipeline endpoint from the studio workspace or programmatically.
97101

98102
### Submit runs by using the designer
99103

100-
Use the following steps to submit a pipeline endpoint run from the designer:
104+
Use the following steps to submit a parameterized pipeline endpoint run from the designer:
101105

102-
1. Go to the **Endpoints** page.
103-
1. Select the **Pipeline endpoints** tab.
104-
1. Select your pipeline endpoint.
105-
1. Select the **Published pipelines** tab.
106-
1. Select the pipeline that you want to run.
106+
1. Go to the **Endpoints** page in your studio workspace.
107+
1. Select the **Pipeline endpoints** tab. Then, select your pipeline endpoint.
108+
1. Select the **Published pipelines** tab. Then, select the pipeline version that you want to run.
107109
1. Select **Submit**.
108-
1. In the setup dialog box, you can specify a new value for the input data path value. This value points to your new dataset.
110+
1. In the setup dialog box, you can specify the parameters values for the run. For this example, update the data path to train your model using a non-US dataset.
109111

110112
![Screenshot that shows how to set up a parameterized pipeline run in the designer](./media/how-to-retrain-designer/published-pipeline-run.png)
111113

@@ -117,4 +119,6 @@ To make a REST call, you need an OAuth 2.0 bearer-type authentication header. Fo
117119

118120
## Next steps
119121

120-
Follow the [designer tutorial](tutorial-designer-automobile-price-train-score.md) to train and deploy a regression model.
122+
In this article, you learned how to create a parameterized training pipeline endpoint using the designer.
123+
124+
For a complete walkthrough of how you can deploy a model to make predictions, see the [designer tutorial](tutorial-designer-automobile-price-train-score.md) to train and deploy a regression model.
-10.3 KB
Loading
13.1 KB
Loading

0 commit comments

Comments
 (0)