You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-designer-transform-data.md
+11-7Lines changed: 11 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -37,19 +37,21 @@ Use the following steps to import the sample dataset.
37
37
38
38
1. Sign in to <ahref="https://ml.azure.com?tabs=jre"target="_blank">ml.azure.com</a>, and select the workspace you want to work with.
39
39
40
-
1. Go to the designer and create a new pipeline by selecting **Easy-to-use prebuilt modules**.
40
+
1. Go to the designer. Select **Easy-to-use-prebuild modules** to create a new pipeline.
41
41
42
42
1. Select a default compute target to run the pipeline.
43
43
44
-
1. To the left of the pipeline canvas is a palette of datasets and modules. Select **Datasets**, and then view the **Samples** section.
44
+
1. To the left of the pipeline canvas is a palette of datasets and modules. Select **Datasets**. Then view the **Samples** section.
45
45
46
46
1. Drag and drop the **Adult Census Income Binary classification** dataset onto the canvas.
47
47
48
48
1. Select the **Adult Census Income** dataset module.
49
49
50
-
1. In the details pane that appears to the right of the canvas, select **Outputs**. Select the visualize icon .
50
+
1. In the details pane that appears to the right of the canvas, select **Outputs**.
51
51
52
-
1. Use the data preview window to explore the dataset. Take note of the "native-country" column values.
52
+
1. Select the visualize icon .
53
+
54
+
1. Use the data preview window to explore the dataset. Take special note of the "native-country" column values.
53
55
54
56
### Split the data
55
57
@@ -90,7 +92,7 @@ Now that your pipeline is set up to split the data, you need to specify where to
90
92
91
93
1. Select the **Export Data** module that is connected to the *left*-most port of the **Split Data** module.
92
94
93
-
The order of the output ports matter for the **Split Data** module. The first output port contains the rows where the regular expression is true. In this case, the first port contains rows for the US-based income, and the second port contains rows for the non-US based income.
95
+
The order of the output ports matter for the **Split Data** module. The first output port contains the rows where the regular expression is true. In this case, the first port contains rows for US-based income, and the second port contains rows for non-US based income.
94
96
95
97
1. In the module details pane to the right of the canvas, set the following options:
96
98
@@ -133,15 +135,17 @@ Now that your pipeline is setup to split and export the data, submit a pipeline
133
135
134
136
1. At the top of the canvas, select **Submit**.
135
137
136
-
1. In the **Set up pipeline run** dialog, select **Create new**.
138
+
1. In the **Set up pipeline run** dialog, select **Create new** to createa n experiment.
139
+
140
+
Experiments logically group together related pipeline runs. If you run this pipeline in the future, you should use the same experiment for logging and tracking purposes.
137
141
138
142
1. Provide a descriptive experiment name like "split-census-data".
139
143
140
144
1. Select **Submit**.
141
145
142
146
## View results
143
147
144
-
After the pipeline finishes running, you can view your results by navigating to your blob storage in the Azure portal. You can also view the intermediary results of the **Split Data** module to confirm that your data has split correctly.
148
+
After the pipeline finishes running, you can view your results by navigating to your blob storage in the Azure portal. You can also view the intermediary results of the **Split Data** module to confirm that your data has been split correctly.
0 commit comments