Merge pull request #298886 from whhender/patch-832205

denrea · web-flow · commit beb88405b086 · 2025-04-25T14:50:48.000-07:00
Editorial updates to tutorial-copy-data-portal.md
diff --git a/articles/data-factory/tutorial-copy-data-portal.md b/articles/data-factory/tutorial-copy-data-portal.md
@@ -1,14 +1,16 @@
 ---
-title: Use the Azure portal to create a data factory pipeline
-description: This tutorial provides step-by-step instructions for using the Azure portal to create a data factory with a pipeline. The pipeline uses the copy activity to copy data from Azure Blob storage to Azure SQL Database.
+title: 'Use the Azure portal to create a data factory pipeline'
+description: This tutorial provides instructions to create a data factory with a pipeline with a copy activity to copy data from Azure Blob storage to Azure SQL Database.
 author: jianleishen
 ms.topic: tutorial
-ms.date: 10/03/2024
+ms.date: 04/25/2025
 ms.subservice: data-movement
 ms.author: jianleishen
+
+#customer intent: As a new Azure Data Factory user I want to create a data factory and quickly create my first pipeline to move data between resources, so I can apply it to my own needs.
 ---
 
-# Copy data from Azure Blob storage to a database in Azure SQL Database by using Azure Data Factory
+# Tutorial: Copy data from Azure Blob storage to a database in Azure SQL Database by using Azure Data Factory
 
 [!INCLUDE[appliesto-adf-asa-md](includes/appliesto-adf-asa-md.md)]
 
@@ -20,14 +22,16 @@ In this tutorial, you create a data factory by using the Azure Data Factory user
 In this tutorial, you perform the following steps:
 
 > [!div class="checklist"]
-> * Create a data factory.
-> * Create a pipeline with a copy activity.
+> * [Create a data factory.](#create-a-data-factory)
+> * [Create a pipeline with a copy activity.](#create-a-pipeline)
 > * Test run the pipeline.
-> * Trigger the pipeline manually.
-> * Trigger the pipeline on a schedule.
+> * [Trigger the pipeline manually.](#trigger-the-pipeline-manually)
+> * [Trigger the pipeline on a schedule.](#trigger-the-pipeline-on-a-schedule)
 > * Monitor the pipeline and activity runs.
+> * [Disable or delete your scheduled trigger.](#disable-trigger)
 
 ## Prerequisites
+
 * **Azure subscription**. If you don't have an Azure subscription, create a [free Azure account](https://azure.microsoft.com/free/) before you begin.
 * **Azure storage account**. You use Blob storage as a *source* data store. If you don't have a storage account, see [Create an Azure storage account](../storage/common/storage-account-create.md) for steps to create one.
 * **Azure SQL Database**. You use the database as a *sink* data store. If you don't have a database in Azure SQL Database, see the [Create a database in Azure SQL Database](/azure/azure-sql/database/single-database-create-quickstart) for steps to create one.
@@ -38,15 +42,16 @@ Now, prepare your Blob storage and SQL database for the tutorial by performing t
 
 #### Create a source blob
 
-1. Launch Notepad. Copy the following text, and save it as an **emp.txt** file on your disk:
+1. Launch Notepad. Copy the following text, and save it as an **emp.txt** file:
 
     ```
     FirstName,LastName
     John,Doe
     Jane,Doe
     ```
 
-1. Create a container named **adftutorial** in your Blob storage. Create a folder named **input** in this container. Then, upload the **emp.txt** file to the **input** folder. Use the Azure portal or tools such as [Azure Storage Explorer](https://storageexplorer.com/) to do these tasks.
+1. Move that file into a folder called input.
+1. Create a container named **adftutorial** in your Blob storage. Upload your **input** folder with the **emp.txt** file to this container. You can use the Azure portal or tools such as [Azure Storage Explorer](https://storageexplorer.com/) to do these tasks.
 
 #### Create a sink SQL table
 
@@ -64,13 +69,14 @@ Now, prepare your Blob storage and SQL database for the tutorial by performing t
     CREATE CLUSTERED INDEX IX_emp_ID ON dbo.emp (ID);
     ```
 
-1. Allow Azure services to access SQL Server. Ensure that **Allow access to Azure services** is turned **ON** for your SQL Server so that Data Factory can write data to your SQL Server. To verify and turn on this setting, go to logical SQL server > Overview > Set server firewall> set the **Allow access to Azure services** option to **ON**.
+1. Allow Azure services to access SQL Server. Ensure that **Allow access to Azure services** is turned **ON** for your SQL Server so that Data Factory can write data to your SQL Server. To verify and turn on this setting, go to your SQL Server in the Azure portal, select **Security** > **Networking** > enable **Selected networks**> check **Allow Azure services and resources to access this server** under the **Exceptions**.
 
 ## Create a data factory
+
 In this step, you create a data factory and start the Data Factory UI to create a pipeline in the data factory.
 
 1. Open **Microsoft Edge** or **Google Chrome**. Currently, Data Factory UI is supported only in Microsoft Edge and Google Chrome web browsers.
-2. On the left menu, select **Create a resource** > **Integration** > **Data Factory**.
+2. On the left menu, select **Create a resource** > **Analytics** > **Data Factory**.
 3. On the **Create Data Factory** page, under **Basics** tab, select the Azure **Subscription** in which you want to create the data factory.
 4. For **Resource Group**, take one of the following steps:
 
@@ -79,28 +85,20 @@ In this step, you create a data factory and start the Data Factory UI to create
     b. Select **Create new**, and enter the name of a new resource group.
     
     To learn about resource groups, see [Use resource groups to manage your Azure resources](../azure-resource-manager/management/overview.md). 
-5. Under **Region**, select a location for the data factory. Only locations that are supported are displayed in the drop-down list. The data stores (for example, Azure Storage and SQL Database) and computes (for example, Azure HDInsight) used by the data factory can be in other regions.
-6. Under **Name**, enter **ADFTutorialDataFactory**.
-
-   The name of the Azure data factory must be *globally unique*. If you receive an error message about the name value, enter a different name for the data factory. (for example, yournameADFTutorialDataFactory). For naming rules for Data Factory artifacts, see [Data Factory naming rules](naming-rules.md).
+5. Under **Region**, select a location for the data factory. Your data stores can be in a different region than your data factory, if they need to be.
+6. Under **Name**, the name of the Azure data factory must be *globally unique*. If you receive an error message about the name value, enter a different name for the data factory. (for example, yournameADFDemo). For naming rules for Data Factory artifacts, see [Data Factory naming rules](naming-rules.md).
 
     :::image type="content" source="./media/doc-common-process/name-not-available-error.png" alt-text="New data factory error message for duplicate name.":::
 
 7. Under **Version**, select **V2**.
 8. Select **Git configuration** tab on the top, and select the **Configure Git later** check box.
 9. Select **Review + create**, and select **Create** after the validation is passed.
 10. After the creation is finished, you see the notice in Notifications center. Select **Go to resource** to navigate to the Data factory page.
-11. Select **Open** on the **Open Azure Data Factory Studio** tile to launch the Azure Data Factory UI in a separate tab.
-
+11. Select **Launch Studio** on the **Azure Data Factory Studio** tile.
 
 ## Create a pipeline
-In this step, you create a pipeline with a copy activity in the data factory. The copy activity copies data from Blob storage to SQL Database. In the [Quickstart tutorial](quickstart-create-data-factory-portal.md), you created a pipeline by following these steps:
-
-1. Create the linked service.
-1. Create input and output datasets.
-1. Create a pipeline.
 
-In this tutorial, you start with creating the pipeline. Then you create linked services and datasets when you need them to configure the pipeline.
+In this step, you create a pipeline with a copy activity in the data factory. The copy activity copies data from Blob storage to SQL Database.
 
 1. On the home page, select **Orchestrate**.
 
@@ -115,14 +113,14 @@ In this tutorial, you start with creating the pipeline. Then you create linked s
 ### Configure source
 
 >[!TIP]
->In this tutorial, you use *Account key* as the authentication type for your source data store, but you can choose other supported authentication methods: *SAS URI*,*Service Principal* and *Managed Identity* if needed. Refer to corresponding sections in [this article](./connector-azure-blob-storage.md#linked-service-properties) for details.
+>In this tutorial, you use *Account key* as the authentication type for your source data store, but you can choose other supported authentication methods: *SAS URI*, *Service Principal*, and *Managed Identity* if needed. Refer to corresponding sections in [this article](./connector-azure-blob-storage.md#linked-service-properties) for details.
 >To store secrets for data stores securely, it's also recommended to use an Azure Key Vault. Refer to [this article](./store-credentials-in-key-vault.md) for detailed illustrations.
 
 1. Go to the **Source** tab. Select **+ New** to create a source dataset.
 
 1. In the **New Dataset** dialog box, select **Azure Blob Storage**, and then select **Continue**. The source data is in Blob storage, so you select **Azure Blob Storage** for the source dataset.
 
-1. In the **Select Format** dialog box, choose the format type of your data, and then select **Continue**.
+1. In the **Select Format** dialog box, choose **Delimited Text**, and then select **Continue**.
 
 1. In the **Set Properties** dialog box, enter **SourceBlobDataset** for Name. Select the checkbox for **First row as header**. Under the **Linked service** text box, select **+ New**.
 
@@ -137,15 +135,16 @@ In this tutorial, you start with creating the pipeline. Then you create linked s
     :::image type="content" source="./media/tutorial-copy-data-portal/source-dataset-selected.png" alt-text="Source dataset":::
 
 ### Configure sink
+
 >[!TIP]
 >In this tutorial, you use *SQL authentication* as the authentication type for your sink data store, but you can choose other supported authentication methods: *Service Principal* and *Managed Identity* if needed. Refer to corresponding sections in [this article](./connector-azure-sql-database.md#linked-service-properties) for details.
 >To store secrets for data stores securely, it's also recommended to use an Azure Key Vault. Refer to [this article](./store-credentials-in-key-vault.md) for detailed illustrations.
 
 1. Go to the **Sink** tab, and select **+ New** to create a sink dataset.
 
-1. In the **New Dataset** dialog box, input "SQL" in the search box to filter the connectors, select **Azure SQL Database**, and then select **Continue**. In this tutorial, you copy data to a SQL database.
+1. In the **New Dataset** dialog box, input "SQL" in the search box to filter the connectors, select **Azure SQL Database**, and then select **Continue**.
 
-1. In the **Set Properties** dialog box, enter **OutputSqlDataset** for Name. From the **Linked service** dropdown list, select **+ New**. A dataset must be associated with a linked service. The linked service has the connection string that Data Factory uses to connect to SQL Database at runtime. The dataset specifies the container, folder, and the file (optional) to which the data is copied.
+1. In the **Set Properties** dialog box, enter **OutputSqlDataset** for Name. From the **Linked service** dropdown list, select **+ New**. A dataset must be associated with a linked service. The linked service has the connection string that Data Factory uses to connect to SQL Database at runtime, and specifies where the data will be copied to.
 
 1. In the **New Linked Service (Azure SQL Database)** dialog box, take the following steps:
 
@@ -165,7 +164,7 @@ In this tutorial, you start with creating the pipeline. Then you create linked s
 
     :::image type="content" source="./media/tutorial-copy-data-portal/new-azure-sql-linked-service-window.png" alt-text="Save new linked service":::
 
-1. It automatically navigates to the **Set Properties** dialog box. In **Table**, select **[dbo].[emp]**. Then select **OK**.
+1. It automatically navigates to the **Set Properties** dialog box. In **Table**, select **Enter manually**, and enter **[dbo].[emp]**. Then select **OK**.
 
 1. Go to the tab with the pipeline, and in **Sink Dataset**, confirm that **OutputSqlDataset** is selected.
 
@@ -174,42 +173,49 @@ In this tutorial, you start with creating the pipeline. Then you create linked s
 You can optionally map the schema of the source to corresponding schema of destination by following [Schema mapping in copy activity](copy-activity-schema-and-type-mapping.md).
 
 ## Validate the pipeline
+
 To validate the pipeline, select **Validate** from the tool bar.
 
 You can see the JSON code associated with the pipeline by clicking **Code** on the upper right.
 
 ## Debug and publish the pipeline
+
 You can debug a pipeline before you publish artifacts (linked services, datasets, and pipeline) to Data Factory or your own Azure Repos Git repository.
 
 1. To debug the pipeline, select **Debug** on the toolbar. You see the status of the pipeline run in the **Output** tab at the bottom of the window.
 
 1. Once the pipeline can run successfully, in the top toolbar, select **Publish all**. This action publishes entities (datasets, and pipelines) you created to Data Factory.
 
-1. Wait until you see the **Successfully published** message. To see notification messages, click the **Show Notifications**  on the top-right (bell button).
+1. Wait until you see the **Successfully published** notification message. To see notification messages, select the **Show Notifications**  on the top-right (bell button).
 
 ## Trigger the pipeline manually
+
 In this step, you manually trigger the pipeline you published in the previous step.
 
-1. Select **Trigger** on the toolbar, and then select **Trigger Now**. On the **Pipeline Run** page, select **OK**.  
+1. Select **Add trigger** on the toolbar, and then select **Trigger Now**.
+
+1. On the **Pipeline Run** page, select **OK**.  
 
 1. Go to the **Monitor** tab on the left. You see a pipeline run that is triggered by a manual trigger. You can use links under the **PIPELINE NAME** column to view activity details and to rerun the pipeline.
 
     :::image type="content" source="./media/tutorial-copy-data-portal/monitor-pipeline-inline-and-expended.png" alt-text="Monitor pipeline runs" lightbox="./media/tutorial-copy-data-portal/monitor-pipeline-inline-and-expended.png":::
 
-1. To see activity runs associated with the pipeline run, select the **CopyPipeline** link under the **PIPELINE NAME** column. In this example, there's only one activity, so you see only one entry in the list. For details about the copy operation, select the **Details** link (eyeglasses icon) under the **ACTIVITY NAME** column. Select **All pipeline runs** at the top to go back to the Pipeline Runs view. To refresh the view, select **Refresh**.
+1. To see activity runs associated with the pipeline run, select the **CopyPipeline** link under the **PIPELINE NAME** column. In this example, there's only one activity, so you see only one entry in the list. For details about the copy operation, hover over the activity and
+1. select the **Details** link (eyeglasses icon) under the **ACTIVITY NAME** column. Select **All pipeline runs** at the top to go back to the Pipeline Runs view. To refresh the view, select **Refresh**.
 
     :::image type="content" source="./media/tutorial-copy-data-portal/view-activity-runs-inline-and-expended.png#lightbox" alt-text="Monitor activity runs" lightbox="./media/tutorial-copy-data-portal/view-activity-runs-inline-and-expended.png":::
 
 1. Verify that two more rows are added to the **emp** table in the database.
 
 ## Trigger the pipeline on a schedule
+
 In this schedule, you create a schedule trigger for the pipeline. The trigger runs the pipeline on the specified schedule, such as hourly or daily. Here you set the trigger to run every minute until the specified end datetime.
 
 1. Go to the **Author** tab on the left above the monitor tab.
 
-1. Go to your pipeline, click **Trigger** on the tool bar, and select **New/Edit**.
+1. Go to your pipeline, select **Trigger** on the tool bar, and select **New/Edit**.
 
-1. In the **Add triggers** dialog box, select **+ New** for **Choose trigger** area.
+1. In the **Add triggers** dialog box, select **Choose trigger** and select **+ New**.
 
 1. In the **New Trigger** window, take the following steps:
 
@@ -232,7 +238,7 @@ In this schedule, you create a schedule trigger for the pipeline. The trigger ru
 
 1. On the **Edit trigger** page, review the warning, and then select **Save**. The pipeline in this example doesn't take any parameters.
 
-1. Click **Publish all** to publish the change.
+1. Select **Publish all** to publish the change.
 
 1. Go to the **Monitor** tab on the left to see the triggered pipeline runs.
 
@@ -244,7 +250,22 @@ In this schedule, you create a schedule trigger for the pipeline. The trigger ru
 
 1. Verify that two rows per minute (for each pipeline run) are inserted into the **emp** table until the specified end time.
 
+## Disable trigger
+
+To disable your every minute trigger that you created, follow these steps:
+
+1. Select the **Manage** pane on the left side.
+
+1. Under **Author** select **Triggers**.
+
+1. Hover over the **RunEveryMinute** trigger you created.
+    1. Select the **Stop** button to disable the trigger from running.
+    1. Select the **Delete** button to disable and delete the trigger.
+
+1. Select **Publish all** to save your changes.
+
 ## Related content
+
 The pipeline in this sample copies data from one location to another location in Blob storage. You learned how to:
 
 > [!div class="checklist"]
@@ -254,9 +275,15 @@ The pipeline in this sample copies data from one location to another location in
 > * Trigger the pipeline manually.
 > * Trigger the pipeline on a schedule.
 > * Monitor the pipeline and activity runs.
+> * Disable or delete your scheduled trigger.
 
 
 Advance to the following tutorial to learn how to copy data from on-premises to the cloud:
 
 > [!div class="nextstepaction"]
 >[Copy data from on-premises to the cloud](tutorial-hybrid-copy-portal.md)
+
+For more information on copying data to or from Azure Blob Storage and Azure SQL Database, see these connector guides:
+
+- [Copy and transform data in Azure Blob Storage](connector-azure-blob-storage.md)
+- [Copy and transform data in Azure SQL Database](connector-azure-sql-database.md)