Skip to content

Commit fd45efb

Browse files
authored
Merge pull request #102815 from PeterCLu/plu-designer-tutorial
Update tutorial
2 parents 49ebf71 + 48e02d0 commit fd45efb

File tree

6 files changed

+72
-77
lines changed

6 files changed

+72
-77
lines changed
93.3 KB
Loading
-54.9 KB
Loading
845 Bytes
Loading

articles/machine-learning/tutorial-designer-automobile-price-deploy.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -35,6 +35,8 @@ To deploy your pipeline, you must first convert the training pipeline into a rea
3535

3636
1. Above the pipeline canvas, select **Create inference pipeline** > **Real-time inference pipeline**.
3737

38+
![Screenshot showing where to find the create pipeline button](./media/tutorial-designer-automobile-price-deploy/create-inference-pipeline.png)
39+
3840
Your pipeline should now look like this:
3941

4042
![Screenshot showing the expected configuration of the pipeline after preparing it for deployment](./media/tutorial-designer-automobile-price-deploy/real-time-inference-pipeline.png)

articles/machine-learning/tutorial-designer-automobile-price-train-score.md

Lines changed: 70 additions & 30 deletions
Original file line numberDiff line numberDiff line change
@@ -9,13 +9,13 @@ services: machine-learning
99
ms.service: machine-learning
1010
ms.subservice: core
1111
ms.topic: tutorial
12-
ms.date: 11/04/2019
12+
ms.date: 01/30/2020
1313
---
1414

1515
# Tutorial: Predict automobile price with the designer (preview)
1616
[!INCLUDE [applies-to-skus](../../includes/aml-applies-to-enterprise-sku.md)]
1717

18-
In this two-part tutorial, you learn how to use the Azure Machine Learning designer to develop and deploy a predictive analytics solution that predicts the price of any car.
18+
In this two-part tutorial, you learn how to use the Azure Machine Learning designer to train and deploy a machine learning model that predicts the price of any car. The designer is a drag-and-drop tool that lets you create machine learning models without a single line of code.
1919

2020
In part one of the tutorial, you'll learn how to:
2121

@@ -41,13 +41,15 @@ To create an Azure Machine Learning pipeline, you need an Azure Machine Learning
4141

4242
### Create a new workspace
4343

44+
In order to use the designer, you first need an Azure Machine Learning workspace. The workspace is the top-level resource for Azure Machine Learning, it provides a centralized place to work with all the artifacts you create in Azure Machine Learning.
45+
4446
If you have an Azure Machine Learning workspace with an Enterprise edition, [skip to the next section](#create-the-pipeline).
4547

4648
[!INCLUDE [aml-create-portal](../../includes/aml-create-in-portal-enterprise.md)]
4749

4850
### Create the pipeline
4951

50-
1. Sign in to [ml.azure.com](https://ml.azure.com), and select the workspace you want to work with.
52+
1. Sign in to <a href="https://ml.azure.com?tabs=jre" target="_blank">ml.azure.com</a>, and select the workspace you want to work with.
5153

5254
1. Select **Designer**.
5355

@@ -57,6 +59,30 @@ If you have an Azure Machine Learning workspace with an Enterprise edition, [ski
5759

5860
1. At the top of the canvas, select the default pipeline name **Pipeline-Created-on**. Rename it to *Automobile price prediction*. The name doesn't need to be unique.
5961

62+
## Set the default compute target
63+
64+
A pipeline runs on a compute target, which is a compute resource that's attached to your workspace. After you create a compute target, you can reuse it for future runs.
65+
66+
You can set a **Default compute target** for the entire pipeline, which will tell every module to use the same compute target by default. However, you can specify compute targets on a per-module basis.
67+
68+
1. Next to the pipeline name, select the **Gear icon** ![Screenshot of the gear icon](./media/tutorial-designer-automobile-price-train-score/gear-icon.png) at the top of the canvas to open the **Settings** pane.
69+
70+
1. In the **Settings** pane to the right of the canvas, select **Select compute target**.
71+
72+
If you already have an available compute target, you can select it to run this pipeline.
73+
74+
> [!NOTE]
75+
> The designer can run experiments only on Azure Machine Learning Compute targets. Other compute targets won't be shown.
76+
77+
1. Enter a name for the compute resource.
78+
79+
1. Select **Save**.
80+
81+
> [!NOTE]
82+
> It takes approximately five minutes to create a compute resource. After the resource is created, you can reuse it and skip this wait time for future runs.
83+
>
84+
> The compute resource autoscales to zero nodes when it's idle to save cost. When you use it again after a delay, you might experience approximately five minutes of wait time while it scales back up.
85+
6086
## Import data
6187

6288
There are several sample datasets included in the designer for you to experiment with. For this tutorial, use **Automobile price data (Raw)**.
@@ -73,7 +99,7 @@ You can visualize the data to understand the dataset that you'll use.
7399

74100
1. Select the **Automobile price data (Raw)** module.
75101

76-
1. In the properties pane to the right of the canvas, select **Outputs**.
102+
1. In the module details pane to the right of the canvas, select **Outputs**.
77103

78104
1. Select the graph icon to visualize the data.
79105

@@ -89,9 +115,9 @@ Datasets typically require some preprocessing before analysis. You might have no
89115

90116
### Remove a column
91117

92-
When you train a model, you have to do something about the data that's missing. In this dataset, the **normalized-losses** column is missing many values, so you exclude that column from the model altogether.
118+
When you train a model, you have to do something about the data that's missing. In this dataset, the **normalized-losses** column is missing many values, so you will exclude that column from the model altogether.
93119

94-
1. Enter **Select** in the search box at the top of the palette to find the **Select Columns in Dataset** module.
120+
1. In the module palette to the left of the canvas, expand the **Data Transformation** section and find the **Select Columns in Dataset** module.
95121

96122
1. Drag the **Select Columns in Dataset** module onto the canvas. Drop the module below the dataset module.
97123

@@ -105,7 +131,7 @@ When you train a model, you have to do something about the data that's missing.
105131

106132
1. Select the **Select Columns in Dataset** module.
107133

108-
1. In the properties pane to the right of the canvas, select **All columns**.
134+
1. In the module details pane to the right of the canvas, select **All columns**.
109135

110136
1. Select the **+** to add a new rule.
111137

@@ -119,7 +145,7 @@ When you train a model, you have to do something about the data that's missing.
119145

120146
1. Select the **Select Columns in Dataset** module.
121147

122-
1. In the properties pane, select the **Comment** text box and enter *Exclude normalized losses*.
148+
1. In the module details pane to the right of the canvas, select the **Comment** text box and enter *Exclude normalized losses*.
123149

124150
Comments will appear on the graph to help you organize your pipeline.
125151

@@ -130,13 +156,15 @@ Your dataset still has missing values after you remove the **normalized-losses**
130156
> [!TIP]
131157
> Cleaning the missing values from input data is a prerequisite for using most of the modules in the designer.
132158
133-
1. Enter **Clean** in the search box to find the **Clean Missing Data** module.
159+
1. In the module palette to the left of the canvas, expand the section **Data Transformation**, and find the **Clean Missing Data** module.
134160

135161
1. Drag the **Clean Missing Data** module to the pipeline canvas. Connect it to the **Select Columns in Dataset** module.
136162

137-
1. In the properties pane, select **Remove entire row** under **Cleaning mode**.
163+
1. Select the **Clean Missing Data** module.
164+
165+
1. In the module details pane to the right of the canvas, select **Remove entire row** under **Cleaning mode**.
138166

139-
1. In the properties pane **Comment** box, enter *Remove missing value rows*.
167+
1. In the module details pane to the right of the canvas, select the **Comment** box, and enter *Remove missing value rows*.
140168

141169
Your pipeline should now look something like this:
142170

@@ -152,26 +180,28 @@ Because you want to predict price, which is a number, you can use a regression a
152180

153181
Splitting data is a common task in machine learning. You will split your data into two separate datasets. One dataset will train the model and the other will test how well the model performed.
154182

155-
1. Enter **split data** in the search box to find the **Split Data** module. Connect the left port of the **Clean Missing Data** module to the **Split Data** module.
183+
1. In the module palette, expand the section **Data Transformation** and find the **Split Data** module.
184+
185+
1. Drag the **Split Data** module to the pipeline canvas.
186+
187+
1. Connect the left port of the **Clean Missing Data** module to the **Split Data** module.
156188

157189
> [!IMPORTANT]
158190
> Be sure that the left output ports of **Clean Missing Data** connects to **Split Data**. The left port contains the the cleaned data. The right port contains the discarted data.
159191
160192
1. Select the **Split Data** module.
161193

162-
1. In the properties pane, set the **Fraction of rows in the first output dataset** to 0.7.
194+
1. In the module details pane to the right of the canvas, set the **Fraction of rows in the first output dataset** to 0.7.
163195

164196
This option splits 70 percent of the data to train the model and 30 percent for testing it. The 70 percent dataset will be accessible through the left output port. The remaining data will be available through the right output port.
165197

166-
1. In the properties pane **Comment** box, enter *Split the dataset into training set (0.7) and test set (0.3)*.
198+
1. In the module details pane to the right of the canvas, select the **Comment** box, and enter *Split the dataset into training set (0.7) and test set (0.3)*.
167199

168200
### Train the model
169201

170202
Train the model by giving it a dataset that includes the price. The algorithm constructs a model that explains the relationship between the features and the price as presented by the training data.
171203

172-
1. To select the learning algorithm, clear your module palette search box.
173-
174-
1. Expand **Machine Learning Algorithms**.
204+
1. In the module palette, expand **Machine Learning Algorithms**.
175205

176206
This option displays several categories of modules that you can use to initialize learning algorithms.
177207

@@ -188,9 +218,11 @@ Train the model by giving it a dataset that includes the price. The algorithm co
188218
189219
![Screenshot showing the correct configuration of the Train Model module. The Linear Regression module connects to left port of Train Model module and the Split Data module connects to right port of Train Model](./media/tutorial-designer-automobile-price-train-score/pipeline-train-model.png)
190220

221+
1. In the module palette, expand the section **Module training**, and drag the **Train Model** module to the canvas.
222+
191223
1. Select the **Train Model** module.
192224

193-
1. In the properties pane, select **Edit column** selector.
225+
1. In the module details pane to the right of the canvas, select **Edit column** selector.
194226

195227
1. In the **Label column** dialog box, expand the drop-down menu and select **Column names**.
196228

@@ -200,15 +232,15 @@ Train the model by giving it a dataset that includes the price. The algorithm co
200232

201233
![Screenshot showing the correct configuration of the pipeline after adding the Train Model module.](./media/tutorial-designer-automobile-price-train-score/pipeline-train-graph.png)
202234

203-
## Score a machine learning model
235+
### Add the Score Model module
204236

205237
After you train your model by using 70 percent of the data, you can use it to score the other 30 percent to see how well your model functions.
206238

207239
1. Enter *score model* in the search box to find the **Score Model** module. Drag the module to the pipeline canvas.
208240

209241
1. Connect the output of the **Train Model** module to the left input port of **Score Model**. Connect the test data output (right port) of the **Split Data** module to the right input port of **Score Model**.
210242

211-
## Evaluate a machine learning model
243+
### Add the Evaluate Model module
212244

213245
Use the **Evaluate Model** module to evaluate how well your model scored the test dataset.
214246

@@ -222,15 +254,28 @@ Use the **Evaluate Model** module to evaluate how well your model scored the tes
222254

223255
## Run the pipeline
224256

225-
[!INCLUDE [aml-ui-create-training-compute](../../includes/aml-ui-create-training-compute.md)]
257+
Now that your pipeline is all setup, you can submit a pipeline run.
258+
259+
1. At the top of the canvas, select **Run**.
260+
261+
1. In the **Set up pipeline run** dialog box, select **+ New experiment** for the **Experiment**.
262+
263+
> [!NOTE]
264+
> Experiments group similar pipeline runs together. If you run a pipeline multiple times, you can select the same experiment for successive runs.
265+
266+
1. Enter a descriptive name for **Experiment Name**.
267+
268+
1. Select **Run**.
269+
270+
You can view run status and details at the top right of the canvas.
226271

227272
### View scored labels
228273

229274
After the run completes, you can view the results of the pipeline run. First, look at the predictions generated by the regression model.
230275

231276
1. Select the **Score Model** module to view its output.
232277

233-
1. In the properties pane, select **Outputs** > graph icon ![visualize icon](./media/tutorial-designer-automobile-price-train-score/visualize-icon.png) to view results.
278+
1. In the module details pane to the right of the canvas, select **Outputs** > graph icon ![visualize icon](./media/tutorial-designer-automobile-price-train-score/visualize-icon.png) to view results.
234279

235280
Here you can see the predicted prices and the actual prices from the testing data.
236281

@@ -242,7 +287,7 @@ Use the **Evaluate Model** to see how well the trained model performed on the te
242287

243288
1. Select the **Evaluate Model** module to view its output.
244289

245-
1. In the properties pane, select **Output** > graph icon ![visualize icon](./media/tutorial-designer-automobile-price-train-score/visualize-icon.png) to view results.
290+
1. In the module details pane to the right of the canvas, select **Output** > graph icon ![visualize icon](./media/tutorial-designer-automobile-price-train-score/visualize-icon.png) to view results.
246291

247292
The following statistics are shown for your model:
248293

@@ -256,17 +301,12 @@ For each of the error statistics, smaller is better. A smaller value indicates t
256301

257302
## Clean up resources
258303

304+
Skip this section if you want to continue on with part 2 of the tutorial, [deploying models](tutorial-designer-automobile-price-deploy.md).
305+
259306
[!INCLUDE [aml-ui-cleanup](../../includes/aml-ui-cleanup.md)]
260307

261308
## Next steps
262309

263-
In part one of this tutorial, you completed the following tasks:
264-
265-
* Create a pipeline
266-
* Prepare the data
267-
* Train the model
268-
* Score and evaluate the model
269-
270310
In part two, you'll learn how to deploy your model as a real-time endpoint.
271311

272312
> [!div class="nextstepaction"]

includes/aml-ui-create-training-compute.md

Lines changed: 0 additions & 47 deletions
This file was deleted.

0 commit comments

Comments
 (0)