You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert

39
+
38
40
Your pipeline should now look like this:
39
41
40
42

In this two-part tutorial, you learn how to use the Azure Machine Learning designer to develop and deploy a predictive analytics solution that predicts the price of any car.
18
+
In this two-part tutorial, you learn how to use the Azure Machine Learning designer to train and deploy a machine learning model that predicts the price of any car. The designer is a drag-and-drop tool that lets you create machine learning models without a single line of code.
19
19
20
20
In part one of the tutorial, you'll learn how to:
21
21
@@ -41,13 +41,15 @@ To create an Azure Machine Learning pipeline, you need an Azure Machine Learning
41
41
42
42
### Create a new workspace
43
43
44
+
In order to use the designer, you first need an Azure Machine Learning workspace. The workspace is the top-level resource for Azure Machine Learning, it provides a centralized place to work with all the artifacts you create in Azure Machine Learning.
45
+
44
46
If you have an Azure Machine Learning workspace with an Enterprise edition, [skip to the next section](#create-the-pipeline).
1. Sign in to [ml.azure.com](https://ml.azure.com), and select the workspace you want to work with.
52
+
1. Sign in to <ahref="https://ml.azure.com?tabs=jre"target="_blank">ml.azure.com</a>, and select the workspace you want to work with.
51
53
52
54
1. Select **Designer**.
53
55
@@ -57,6 +59,30 @@ If you have an Azure Machine Learning workspace with an Enterprise edition, [ski
57
59
58
60
1. At the top of the canvas, select the default pipeline name **Pipeline-Created-on**. Rename it to *Automobile price prediction*. The name doesn't need to be unique.
59
61
62
+
## Set the default compute target
63
+
64
+
A pipeline runs on a compute target, which is a compute resource that's attached to your workspace. After you create a compute target, you can reuse it for future runs.
65
+
66
+
You can set a **Default compute target** for the entire pipeline, which will tell every module to use the same compute target by default. However, you can specify compute targets on a per-module basis.
67
+
68
+
1. Next to the pipeline name, select the **Gear icon** at the top of the canvas to open the **Settings** pane.
69
+
70
+
1. In the **Settings** pane to the right of the canvas, select **Select compute target**.
71
+
72
+
If you already have an available compute target, you can select it to run this pipeline.
73
+
74
+
> [!NOTE]
75
+
> The designer can run experiments only on Azure Machine Learning Compute targets. Other compute targets won't be shown.
76
+
77
+
1. Enter a name for the compute resource.
78
+
79
+
1. Select **Save**.
80
+
81
+
> [!NOTE]
82
+
> It takes approximately five minutes to create a compute resource. After the resource is created, you can reuse it and skip this wait time for future runs.
83
+
>
84
+
> The compute resource autoscales to zero nodes when it's idle to save cost. When you use it again after a delay, you might experience approximately five minutes of wait time while it scales back up.
85
+
60
86
## Import data
61
87
62
88
There are several sample datasets included in the designer for you to experiment with. For this tutorial, use **Automobile price data (Raw)**.
@@ -73,7 +99,7 @@ You can visualize the data to understand the dataset that you'll use.
73
99
74
100
1. Select the **Automobile price data (Raw)** module.
75
101
76
-
1. In the properties pane to the right of the canvas, select **Outputs**.
102
+
1. In the module details pane to the right of the canvas, select **Outputs**.
77
103
78
104
1. Select the graph icon to visualize the data.
79
105
@@ -89,9 +115,9 @@ Datasets typically require some preprocessing before analysis. You might have no
89
115
90
116
### Remove a column
91
117
92
-
When you train a model, you have to do something about the data that's missing. In this dataset, the **normalized-losses** column is missing many values, so you exclude that column from the model altogether.
118
+
When you train a model, you have to do something about the data that's missing. In this dataset, the **normalized-losses** column is missing many values, so you will exclude that column from the model altogether.
93
119
94
-
1.Enter **Select** in the search box at the top of the palette to find the **Select Columns in Dataset** module.
120
+
1.In the module palette to the left of the canvas, expand the **Data Transformation** section and find the **Select Columns in Dataset** module.
95
121
96
122
1. Drag the **Select Columns in Dataset** module onto the canvas. Drop the module below the dataset module.
97
123
@@ -105,7 +131,7 @@ When you train a model, you have to do something about the data that's missing.
105
131
106
132
1. Select the **Select Columns in Dataset** module.
107
133
108
-
1. In the properties pane to the right of the canvas, select **All columns**.
134
+
1. In the module details pane to the right of the canvas, select **All columns**.
109
135
110
136
1. Select the **+** to add a new rule.
111
137
@@ -119,7 +145,7 @@ When you train a model, you have to do something about the data that's missing.
119
145
120
146
1. Select the **Select Columns in Dataset** module.
121
147
122
-
1. In the properties pane, select the **Comment** text box and enter *Exclude normalized losses*.
148
+
1. In the module details pane to the right of the canvas, select the **Comment** text box and enter *Exclude normalized losses*.
123
149
124
150
Comments will appear on the graph to help you organize your pipeline.
125
151
@@ -130,13 +156,15 @@ Your dataset still has missing values after you remove the **normalized-losses**
130
156
> [!TIP]
131
157
> Cleaning the missing values from input data is a prerequisite for using most of the modules in the designer.
132
158
133
-
1.Enter **Clean** in the search box to find the **Clean Missing Data** module.
159
+
1.In the module palette to the left of the canvas, expand the section **Data Transformation**, and find the **Clean Missing Data** module.
134
160
135
161
1. Drag the **Clean Missing Data** module to the pipeline canvas. Connect it to the **Select Columns in Dataset** module.
136
162
137
-
1. In the properties pane, select **Remove entire row** under **Cleaning mode**.
163
+
1. Select the **Clean Missing Data** module.
164
+
165
+
1. In the module details pane to the right of the canvas, select **Remove entire row** under **Cleaning mode**.
138
166
139
-
1. In the properties pane **Comment** box, enter *Remove missing value rows*.
167
+
1. In the module details pane to the right of the canvas, select the **Comment** box, and enter *Remove missing value rows*.
140
168
141
169
Your pipeline should now look something like this:
142
170
@@ -152,26 +180,28 @@ Because you want to predict price, which is a number, you can use a regression a
152
180
153
181
Splitting data is a common task in machine learning. You will split your data into two separate datasets. One dataset will train the model and the other will test how well the model performed.
154
182
155
-
1. Enter **split data** in the search box to find the **Split Data** module. Connect the left port of the **Clean Missing Data** module to the **Split Data** module.
183
+
1. In the module palette, expand the section **Data Transformation** and find the **Split Data** module.
184
+
185
+
1. Drag the **Split Data** module to the pipeline canvas.
186
+
187
+
1. Connect the left port of the **Clean Missing Data** module to the **Split Data** module.
156
188
157
189
> [!IMPORTANT]
158
190
> Be sure that the left output ports of **Clean Missing Data** connects to **Split Data**. The left port contains the the cleaned data. The right port contains the discarted data.
159
191
160
192
1. Select the **Split Data** module.
161
193
162
-
1. In the properties pane, set the **Fraction of rows in the first output dataset** to 0.7.
194
+
1. In the module details pane to the right of the canvas, set the **Fraction of rows in the first output dataset** to 0.7.
163
195
164
196
This option splits 70 percent of the data to train the model and 30 percent for testing it. The 70 percent dataset will be accessible through the left output port. The remaining data will be available through the right output port.
165
197
166
-
1. In the properties pane **Comment** box, enter *Split the dataset into training set (0.7) and test set (0.3)*.
198
+
1. In the module details pane to the right of the canvas, select the **Comment** box, and enter *Split the dataset into training set (0.7) and test set (0.3)*.
167
199
168
200
### Train the model
169
201
170
202
Train the model by giving it a dataset that includes the price. The algorithm constructs a model that explains the relationship between the features and the price as presented by the training data.
171
203
172
-
1. To select the learning algorithm, clear your module palette search box.
173
-
174
-
1. Expand **Machine Learning Algorithms**.
204
+
1. In the module palette, expand **Machine Learning Algorithms**.
175
205
176
206
This option displays several categories of modules that you can use to initialize learning algorithms.
177
207
@@ -188,9 +218,11 @@ Train the model by giving it a dataset that includes the price. The algorithm co
188
218
189
219

190
220
221
+
1. In the module palette, expand the section **Module training**, and drag the **Train Model** module to the canvas.
222
+
191
223
1. Select the **Train Model** module.
192
224
193
-
1. In the properties pane, select **Edit column** selector.
225
+
1. In the module details pane to the right of the canvas, select **Edit column** selector.
194
226
195
227
1. In the **Label column** dialog box, expand the drop-down menu and select **Column names**.
196
228
@@ -200,15 +232,15 @@ Train the model by giving it a dataset that includes the price. The algorithm co
200
232
201
233

202
234
203
-
##Score a machine learning model
235
+
### Add the Score Model module
204
236
205
237
After you train your model by using 70 percent of the data, you can use it to score the other 30 percent to see how well your model functions.
206
238
207
239
1. Enter *score model* in the search box to find the **Score Model** module. Drag the module to the pipeline canvas.
208
240
209
241
1. Connect the output of the **Train Model** module to the left input port of **Score Model**. Connect the test data output (right port) of the **Split Data** module to the right input port of **Score Model**.
210
242
211
-
##Evaluate a machine learning model
243
+
### Add the Evaluate Model module
212
244
213
245
Use the **Evaluate Model** module to evaluate how well your model scored the test dataset.
214
246
@@ -222,15 +254,28 @@ Use the **Evaluate Model** module to evaluate how well your model scored the tes
Now that your pipeline is all setup, you can submit a pipeline run.
258
+
259
+
1. At the top of the canvas, select **Run**.
260
+
261
+
1. In the **Set up pipeline run** dialog box, select **+ New experiment** for the **Experiment**.
262
+
263
+
> [!NOTE]
264
+
> Experiments group similar pipeline runs together. If you run a pipeline multiple times, you can select the same experiment for successive runs.
265
+
266
+
1. Enter a descriptive name for **Experiment Name**.
267
+
268
+
1. Select **Run**.
269
+
270
+
You can view run status and details at the top right of the canvas.
226
271
227
272
### View scored labels
228
273
229
274
After the run completes, you can view the results of the pipeline run. First, look at the predictions generated by the regression model.
230
275
231
276
1. Select the **Score Model** module to view its output.
232
277
233
-
1. In the properties pane, select **Outputs** > graph icon  to view results.
278
+
1. In the module details pane to the right of the canvas, select **Outputs** > graph icon  to view results.
234
279
235
280
Here you can see the predicted prices and the actual prices from the testing data.
236
281
@@ -242,7 +287,7 @@ Use the **Evaluate Model** to see how well the trained model performed on the te
242
287
243
288
1. Select the **Evaluate Model** module to view its output.
244
289
245
-
1. In the properties pane, select **Output** > graph icon  to view results.
290
+
1. In the module details pane to the right of the canvas, select **Output** > graph icon  to view results.
246
291
247
292
The following statistics are shown for your model:
248
293
@@ -256,17 +301,12 @@ For each of the error statistics, smaller is better. A smaller value indicates t
256
301
257
302
## Clean up resources
258
303
304
+
Skip this section if you want to continue on with part 2 of the tutorial, [deploying models](tutorial-designer-automobile-price-deploy.md).
0 commit comments