Skip to content

Commit 3d54fbd

Browse files
authored
Merge pull request #168395 from Clare-Zheng82/0805-Update_CopyDataTool_Gen1-Gen2
Update Copy Data Tool (from gen1 to gen2) Document
2 parents bc90b72 + dbff3cc commit 3d54fbd

12 files changed

+32
-36
lines changed

articles/data-factory/load-azure-data-lake-storage-gen2-from-gen1.md

Lines changed: 32 additions & 36 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.service: data-factory
77
ms.subservice: data-movement
88
ms.topic: conceptual
99
ms.custom: seo-lt-2019
10-
ms.date: 07/05/2021
10+
ms.date: 08/06/2021
1111
---
1212

1313
# Copy data from Azure Data Lake Storage Gen1 to Gen2 with Azure Data Factory
@@ -58,75 +58,71 @@ This article shows you how to use the Data Factory copy data tool to copy data f
5858
1. On the home page, select the **Ingest** tile to launch the copy data tool.
5959

6060
![Screenshot that shows the ADF home page.](./media/doc-common-process/get-started-page.png )
61-
2. On the **Properties** page, specify **CopyFromADLSGen1ToGen2** for the **Task name** field. Select **Next**.
6261

63-
![Properties page](./media/load-azure-data-lake-storage-gen2-from-gen1/copy-data-tool-properties-page.png)
64-
3. On the **Source data store** page, select **+ Create new connection**.
62+
2. On the **Properties** page, choose **Built-in copy task** under **Task type**, and choose **Run once now** under **Task cadence or task schedule**, then select **Next**.
6563

66-
![Source data store page](./media/load-azure-data-lake-storage-gen2-from-gen1/source-data-store-page.png)
64+
3. On the **Source data store** page, select **+ New connection**.
6765

6866
4. Select **Azure Data Lake Storage Gen1** from the connector gallery, and select **Continue**.
6967

7068
![Source data store Azure Data Lake Storage Gen1 page](./media/load-azure-data-lake-storage-gen2-from-gen1/source-data-store-page-adls-gen1.png)
7169

72-
5. On the **Specify Azure Data Lake Storage Gen1 connection** page, follow these steps:
73-
74-
a. Select your Data Lake Storage Gen1 for the account name, and specify or validate the **Tenant**.
75-
76-
b. Select **Test connection** to validate the settings. Then select **Finish**.
70+
5. On the **New connection (Azure Data Lake Storage Gen1)** page, follow these steps:
71+
1. Select your Data Lake Storage Gen1 for the account name, and specify or validate the **Tenant**.
72+
1. Select **Test connection** to validate the settings. Then select **Create**.
7773

78-
c. You see that a new connection was created. Select **Next**.
79-
8074
> [!IMPORTANT]
8175
> In this walk-through, you use a managed identity for Azure resources to authenticate your Azure Data Lake Storage Gen1. To grant the managed identity the proper permissions in Azure Data Lake Storage Gen1, follow [these instructions](connector-azure-data-lake-store.md#managed-identity).
8276
8377
![Specify Azure Data Lake Storage Gen1 account](./media/load-azure-data-lake-storage-gen2-from-gen1/specify-adls-gen1-account.png)
8478

85-
6. On the **Choose the input file or folder** page, browse to the folder and file that you want to copy over. Select the folder or file, and select **Choose**.
86-
87-
![Choose input file or folder](./media/load-azure-data-lake-storage-gen2-from-gen1/choose-input-folder.png)
88-
89-
7. Specify the copy behavior by selecting the **Copy files recursively** and **Binary copy** options. Select **Next**.
90-
91-
![Screenshot shows the Choose the input file or folder where you can select Copy file recursively and Binary Copy.](./media/load-azure-data-lake-storage-gen2-from-gen1/specify-binary-copy.png)
79+
6. On the **Source data store** page, complete the following steps.
80+
1. Select the newly created connection in the **Connection** section.
81+
1. Under **File or folder**, browse to the folder and file that you want to copy over. Select the folder or file, and select **OK**.
82+
1. Specify the copy behavior by selecting the **Recursively** and **Binary copy** options. Select **Next**.
9283

93-
8. On the **Destination data store** page, select **+ Create new connection** > **Azure Data Lake Storage Gen2** > **Continue**.
84+
:::image type="content" source="./media/load-azure-data-lake-storage-gen2-from-gen1/source-data-store-page.png" alt-text="Screenshot showing the source data store page.":::
85+
86+
7. On the **Destination data store** page, select **+ New connection** > **Azure Data Lake Storage Gen2** > **Continue**.
9487

95-
![Destination data store page](./media/load-azure-data-lake-storage-gen2-from-gen1/destination-data-storage-page.png)
88+
![Destination data store page](./media/load-azure-data-lake-storage-gen2-from-gen1/destination-data-store-page-adls-gen2.png)
9689

97-
9. On the **Specify Azure Data Lake Storage Gen2 connection** page, follow these steps:
90+
8. On the **New connection (Azure Data Lake Storage Gen2)** page, follow these steps:
91+
1. Select your Data Lake Storage Gen2 capable account from the **Storage account name** drop-down list.
92+
1. Select **Create** to create the connection.
9893

99-
a. Select your Data Lake Storage Gen2 capable account from the **Storage account name** drop-down list.
100-
101-
b. Select **Finish** to create the connection. Then select **Next**.
102-
10394
![Specify Azure Data Lake Storage Gen2 account](./media/load-azure-data-lake-storage-gen2-from-gen1/specify-adls-gen2-account.png)
10495

105-
10. On the **Choose the output file or folder** page, enter **copyfromadlsgen1** as the output folder name, and select **Next**. Data Factory creates the corresponding Azure Data Lake Storage Gen2 file system and subfolders during copy if they don't exist.
96+
9. On the **Destination data store** page, complete the following steps.
97+
1. Select the newly created connection in the **Connection** block.
98+
1. Under **Folder path**, enter **copyfromadlsgen1** as the output folder name, and select **Next**. Data Factory creates the corresponding Azure Data Lake Storage Gen2 file system and subfolders during copy if they don't exist.
10699

107-
![Screenshot shows the folder path you enter.](./media/load-azure-data-lake-storage-gen2-from-gen1/specify-adls-gen2-path.png)
100+
:::image type="content" source="./media/load-azure-data-lake-storage-gen2-from-gen1/destination-data-store-page.png" alt-text="Screenshot showing the destination data store page.":::
108101

109-
11. On the **Settings** page, select **Next** to use the default settings.
102+
10. On the **Settings** page, specify **CopyFromADLSGen1ToGen2** for the **Task name** field, then select **Next** to use the default settings.
110103

111-
12. On the **Summary** page, review the settings, and select **Next**.
104+
105+
11. On the **Summary** page, review the settings, and select **Next**.
112106

113107
![Summary page](./media/load-azure-data-lake-storage-gen2-from-gen1/copy-summary.png)
114-
13. On the **Deployment page**, select **Monitor** to monitor the pipeline.
108+
109+
12. On the **Deployment page**, select **Monitor** to monitor the pipeline.
115110

116111
![Deployment page](./media/load-azure-data-lake-storage-gen2-from-gen1/deployment-page.png)
117-
14. Notice that the **Monitor** tab on the left is automatically selected. The **Actions** column includes links to view activity run details and to rerun the pipeline.
112+
113+
13. Notice that the **Monitor** tab on the left is automatically selected. The **Pipeline name** column includes links to view activity run details and to rerun the pipeline.
118114

119115
![Monitor pipeline runs](./media/load-azure-data-lake-storage-gen2-from-gen1/monitor-pipeline-runs.png)
120116

121-
15. To view activity runs that are associated with the pipeline run, select the **View Activity Runs** link in the **Actions** column. There's only one activity (copy activity) in the pipeline, so you see only one entry. To switch back to the pipeline runs view, select the **Pipelines** link at the top. Select **Refresh** to refresh the list.
117+
14. To view activity runs that are associated with the pipeline run, select the link in the **Pipeline name** column. There's only one activity (copy activity) in the pipeline, so you see only one entry. To switch back to the pipeline runs view, select the **All pipeline runs** link in the breadcrumb menu at the top. Select **Refresh** to refresh the list.
122118

123119
![Monitor activity runs](./media/load-azure-data-lake-storage-gen2-from-gen1/monitor-activity-runs.png)
124120

125-
16. To monitor the execution details for each copy activity, select the **Details** link (eyeglasses image) under **Actions** in the activity monitoring view. You can monitor details like the volume of data copied from the source to the sink, data throughput, execution steps with corresponding duration, and used configurations.
121+
15. To monitor the execution details for each copy activity, select the **Details** link (eyeglasses image) under the **Activity name** column in the activity monitoring view. You can monitor details like the volume of data copied from the source to the sink, data throughput, execution steps with corresponding duration, and used configurations.
126122

127-
![Monitor activity run details](./media/load-azure-data-lake-storage-gen2-from-gen1/monitor-activity-run-details.png)
123+
:::image type="content" source="./media/load-azure-data-lake-storage-gen2-from-gen1/monitor-activity-run-details.png" alt-text="Screenshot showing the activity run details.":::
128124

129-
17. Verify that the data is copied into your Azure Data Lake Storage Gen2 account.
125+
16. Verify that the data is copied into your Azure Data Lake Storage Gen2 account.
130126

131127
## Best practices
132128

-16.7 KB
Loading
4.45 KB
Loading
24.4 KB
Loading
55.7 KB
Loading
26.5 KB
Loading
3.6 KB
Loading
-2.99 KB
Loading
-46.9 KB
Loading
681 Bytes
Loading

0 commit comments

Comments
 (0)