You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/algorithm-module-reference/export-data.md
+10-4Lines changed: 10 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,13 +9,13 @@ ms.topic: reference
9
9
10
10
author: likebupt
11
11
ms.author: keli19
12
-
ms.date: 10/22/2019
12
+
ms.date: 02/22/2020
13
13
---
14
14
# Export Data module
15
15
16
16
This article describes a module in Azure Machine Learning designer (preview).
17
17
18
-
Use this module to save results, intermediate data, and working data from your pipelines into cloud storage destinations outside Azure Machine Learning.
18
+
Use this module to save results, intermediate data, and working data from your pipelines into cloud storage destinations.
19
19
20
20
This module supports exporting your data to the following cloud data services:
21
21
@@ -24,7 +24,7 @@ This module supports exporting your data to the following cloud data services:
24
24
- Azure Data Lake
25
25
- Azure Data Lake Gen2
26
26
27
-
Before exporting your data, you need to first register a datastore in your Azure Machine Learning workspace first. For more information, see [Access data in Azure storage services](../how-to-access-data.md).
27
+
Before exporting your data, you need to first register a datastore in your Azure Machine Learning workspace. For more information, see [Access data in Azure storage services](../how-to-access-data.md).
28
28
29
29
## How to configure Export Data
30
30
@@ -36,7 +36,13 @@ Before exporting your data, you need to first register a datastore in your Azure
36
36
37
37
1. For **Datastore**, select an existing datastore from the dropdown list. You can also create a new datastore. Check how by visiting [Access data in Azure storage services](../how-to-access-data.md).
38
38
39
-
1. Define the path in the datastore to write the data to.
39
+
1. The checkbox, **Regenerate output**, decides whether to execute the module to regenerate output at running time.
40
+
41
+
It's by default unselected, which means if the module has been executed with the same parameters previously, the system will reuse the output from last run to reduce run time.
42
+
43
+
If it is selected, the system will execute the module again to regenerate output.
44
+
45
+
1. Define the path in the datastore where the data is. The path is a relative path. The empty paths or a URL paths are not allowed.
40
46
41
47
42
48
1. For **File format**, select the format in which data should be stored.
Copy file name to clipboardExpand all lines: articles/machine-learning/algorithm-module-reference/import-data.md
+6-3Lines changed: 6 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -43,9 +43,7 @@ If your source data changes, you can refresh the dataset and add new data by rer
43
43
44
44
1. Add the **Import Data** module to your pipeline. You can find this module in the **Data Input and Output** category in the designer.
45
45
46
-
1. Click **Launch Data Import Wizard** to configure the data source using a wizard.
47
-
48
-
The wizard gets the account name and credentials, and help you configure other options. If you are editing an existing configuration, it loads the current values first.
46
+
1. Select the module to open the right pane.
49
47
50
48
1. Select **Data source**, and choose the data source type. It could be HTTP or datastore.
51
49
@@ -56,6 +54,11 @@ If your source data changes, you can refresh the dataset and add new data by rer
1. The checkbox, **Regenerate output**, decides whether to execute the module to regenerate output at running time.
58
+
59
+
It's by default unselected, which means if the module has been executed with the same parameters previously, the system will reuse the output from last run to reduce run time.
60
+
61
+
If it is selected, the system will execute the module again to regenerate output. So select this option when underlying data in storage is updated, it can help to get the latest data.
0 commit comments