Skip to content

Commit 376f2cb

Browse files
committed
Edit pass
1 parent c79e160 commit 376f2cb

File tree

1 file changed

+11
-7
lines changed

1 file changed

+11
-7
lines changed

articles/machine-learning/how-to-designer-transform-data.md

Lines changed: 11 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -37,19 +37,21 @@ Use the following steps to import the sample dataset.
3737

3838
1. Sign in to <a href="https://ml.azure.com?tabs=jre" target="_blank">ml.azure.com</a>, and select the workspace you want to work with.
3939

40-
1. Go to the designer and create a new pipeline by selecting **Easy-to-use prebuilt modules**.
40+
1. Go to the designer. Select **Easy-to-use-prebuild modules** to create a new pipeline.
4141

4242
1. Select a default compute target to run the pipeline.
4343

44-
1. To the left of the pipeline canvas is a palette of datasets and modules. Select **Datasets**, and then view the **Samples** section.
44+
1. To the left of the pipeline canvas is a palette of datasets and modules. Select **Datasets**. Then view the **Samples** section.
4545

4646
1. Drag and drop the **Adult Census Income Binary classification** dataset onto the canvas.
4747

4848
1. Select the **Adult Census Income** dataset module.
4949

50-
1. In the details pane that appears to the right of the canvas, select **Outputs**. Select the visualize icon ![visualize icon](media/how-to-designer-transform-data/visualize-icon.png).
50+
1. In the details pane that appears to the right of the canvas, select **Outputs**.
5151

52-
1. Use the data preview window to explore the dataset. Take note of the "native-country" column values.
52+
1. Select the visualize icon ![visualize icon](media/how-to-designer-transform-data/visualize-icon.png).
53+
54+
1. Use the data preview window to explore the dataset. Take special note of the "native-country" column values.
5355

5456
### Split the data
5557

@@ -90,7 +92,7 @@ Now that your pipeline is set up to split the data, you need to specify where to
9092

9193
1. Select the **Export Data** module that is connected to the *left*-most port of the **Split Data** module.
9294

93-
The order of the output ports matter for the **Split Data** module. The first output port contains the rows where the regular expression is true. In this case, the first port contains rows for the US-based income, and the second port contains rows for the non-US based income.
95+
The order of the output ports matter for the **Split Data** module. The first output port contains the rows where the regular expression is true. In this case, the first port contains rows for US-based income, and the second port contains rows for non-US based income.
9496

9597
1. In the module details pane to the right of the canvas, set the following options:
9698

@@ -133,15 +135,17 @@ Now that your pipeline is setup to split and export the data, submit a pipeline
133135

134136
1. At the top of the canvas, select **Submit**.
135137

136-
1. In the **Set up pipeline run** dialog, select **Create new**.
138+
1. In the **Set up pipeline run** dialog, select **Create new** to createa n experiment.
139+
140+
Experiments logically group together related pipeline runs. If you run this pipeline in the future, you should use the same experiment for logging and tracking purposes.
137141

138142
1. Provide a descriptive experiment name like "split-census-data".
139143

140144
1. Select **Submit**.
141145

142146
## View results
143147

144-
After the pipeline finishes running, you can view your results by navigating to your blob storage in the Azure portal. You can also view the intermediary results of the **Split Data** module to confirm that your data has split correctly.
148+
After the pipeline finishes running, you can view your results by navigating to your blob storage in the Azure portal. You can also view the intermediary results of the **Split Data** module to confirm that your data has been split correctly.
145149

146150
1. Select the **Split Data** module.
147151

0 commit comments

Comments
 (0)