Skip to content

Commit 076039a

Browse files
authored
Merge pull request #127684 from djpmsft/uxRelease
updating debug docs
2 parents cb7e641 + b3167f2 commit 076039a

17 files changed

+43
-29
lines changed
Lines changed: 43 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -1,73 +1,87 @@
11
---
22
title: Iterative development and debugging in Azure Data Factory
3-
description: Learn how to develop and debug Data Factory pipelines iteratively in the Azure portal.
4-
ms.date: 09/26/2018
3+
description: Learn how to develop and debug Data Factory pipelines iteratively in the ADF UX
4+
ms.date: 08/28/2020
55
ms.topic: conceptual
66
ms.service: data-factory
77
services: data-factory
88
documentationcenter: ''
99
ms.workload: data-services
10-
1110
author: djpmsft
1211
ms.author: daperlov
13-
manager: anandsub
12+
1413
---
1514
# Iterative development and debugging with Azure Data Factory
1615
[!INCLUDE[appliesto-adf-asa-md](includes/appliesto-adf-asa-md.md)]
1716

18-
Azure Data Factory lets you iteratively develop and debug Data Factory pipelines.
17+
Azure Data Factory lets you iteratively develop and debug Data Factory pipelines as you are developing your data integration solutions. These features allow you to test your changes before creating a pull request or publishing them to the data factory service.
1918

2019
For an eight-minute introduction and demonstration of this feature, watch the following video:
2120

2221
> [!VIDEO https://channel9.msdn.com/Shows/Azure-Friday/Iterative-development-and-debugging-with-Azure-Data-Factory/player]
2322
24-
## Iterative debugging features
25-
Create pipelines and do test runs using the **Debug** capability in the pipeline canvas without writing a single line of code.
23+
## Debugging a pipeline
2624

27-
![Debug capability on the pipeline canvas](media/iterative-development-debugging/iterative-development-image1.png)
25+
As you author using the pipeline canvas, you can test your activities using the **Debug** capability. When you do test runs, you don't have to publish your changes to the data factory before you select **Debug**. This feature is helpful in scenarios where you want to make sure that the changes work as expected before you update the data factory workflow.
2826

29-
View the results of your test runs in the **Output** window of the pipeline canvas.
27+
![Debug capability on the pipeline canvas](media/iterative-development-debugging/iterative-development-1.png)
3028

31-
![Output window of the pipeline canvas](media/iterative-development-debugging/iterative-development-image2.png)
29+
As the pipeline is running, you can see the results of each activity in the **Output** tab of the pipeline canvas.
3230

33-
After a test run succeeds, add more activities to your pipeline and continue debugging in an iterative manner. You can also **Cancel** a test run while it is in progress.
31+
View the results of your test runs in the **Output** window of the pipeline canvas.
3432

35-
![Cancel a test run](media/iterative-development-debugging/iterative-development-image3.png)
33+
![Output window of the pipeline canvas](media/iterative-development-debugging/iterative-development-2.png)
3634

37-
When you do test runs, you don't have to publish your changes to the data factory before you select **Debug**. This feature is helpful in scenarios where you want to make sure that the changes work as expected before you update the data factory workflow.
35+
After a test run succeeds, add more activities to your pipeline and continue debugging in an iterative manner. You can also **Cancel** a test run while it is in progress.
3836

3937
> [!IMPORTANT]
40-
> Selecting **Debug** actually runs the pipeline. So, for example, if the pipeline contains copy activity, the test run copies data from source to destination. As a result, we recommend that you use test folders in your copy activities and other activities when debugging. After you've debugged the pipeline, switch to the actual folders that you want to use in normal operations.
38+
> Selecting **Debug** actually runs the pipeline. For example, if the pipeline contains copy activity, the test run copies data from source to destination. As a result, we recommend that you use test folders in your copy activities and other activities when debugging. After you've debugged the pipeline, switch to the actual folders that you want to use in normal operations.
39+
40+
### Setting breakpoints
4141

42-
## Visualizing debug runs
42+
Azure Data Factory allows for you to debug a pipeline until you reach a particular activity on the pipeline canvas. Put a breakpoint on the activity until which you want to test, and select **Debug**. Data Factory ensures that the test runs only until the breakpoint activity on the pipeline canvas. This *Debug Until* feature is useful when you don't want to test the entire pipeline, but only a subset of activities inside the pipeline.
4343

44-
You can visualize all the debug runs that are in progress for your data factory in one place. Select **View debug runs** in the upper right corner of the page. This feature is useful in scenarios where you have master pipelines kicking off debug runs for child pipelines, and you want a single view to see all the active debug runs.
44+
![Breakpoints on the pipeline canvas](media/iterative-development-debugging/iterative-development-3.png)
4545

46-
![Select the View active debug runs icon](media/iterative-development-debugging/view-debug-runs-image1.png)
46+
To set a breakpoint, select an element on the pipeline canvas. A *Debug Until* option appears as an empty red circle at the upper right corner of the element.
4747

48-
![Sample list of active debug runs](media/iterative-development-debugging/view-debug-runs-image2.png)
48+
![Before setting a breakpoint on the selected element](media/iterative-development-debugging/iterative-development-4.png)
4949

50-
If you have active Data Flow debug sessions, those sessions will appear on the bottom part of the active debug window. You can select an active data flow session and stop the respective cluster.
50+
After you select the *Debug Until* option, it changes to a filled red circle to indicate the breakpoint is enabled.
5151

52-
![Sample list of active data flow debug runs](media/data-flow/dfsessions.png)
52+
![After setting a breakpoint on the selected element](media/iterative-development-debugging/iterative-development-5.png)
5353

5454
## Monitoring debug runs
5555

56-
The test runs initiated with the **Debug** capability are not available in the list on the **Monitor** tab. You can only see runs triggered with **Trigger Now**, **Schedule**, or **Tumbling Window** triggers in the **Monitor** tab. You can see the last test run initiated with the **Debug** capability in the **Output** window of the pipeline canvas.
56+
When you run a pipeline debug run, the results will appear in the **Output** window of the pipeline canvas. The output tab will only contain the most recent run that occurred during the current browser session.
5757

58-
## Setting breakpoints for debugging
58+
![Output window of the pipeline canvas](media/iterative-development-debugging/iterative-development-2.png)
5959

60-
Data Factory also lets you debug until you reach a particular activity on the pipeline canvas. Just put a breakpoint on the activity until which you want to test, and select **Debug**. Data Factory ensures that the test runs only until the breakpoint activity on the pipeline canvas. This *Debug Until* feature is useful when you don't want to test the entire pipeline, but only a subset of activities inside the pipeline.
60+
To view a historical view of debug runs or see a list of all active debug runs, you can go into the **Monitor** experience.
6161

62-
![Breakpoints on the pipeline canvas](media/iterative-development-debugging/iterative-development-image4.png)
62+
![Select the View active debug runs icon](media/iterative-development-debugging/view-debug-runs.png)
6363

64-
To set a breakpoint, select an element on the pipeline canvas. A *Debug Until* option appears as an empty red circle at the upper right corner of the element.
64+
> [!NOTE]
65+
> The Azure Data Factory service only persists debug run history for 15 days.
6566
66-
![Before setting a breakpoint on the selected element](media/iterative-development-debugging/iterative-development-image5.png)
67+
## Debugging mapping data flows
6768

68-
After you select the *Debug Until* option, it changes to a filled red circle to indicate the breakpoint is enabled.
69+
Mapping data flows allow you to build code-free data transformation logic that runs at scale. When building your logic, you can turn on a debug session to interactively work with your data using a live Spark cluster. To learn more, read about [mapping data flow debug mode](concepts-data-flow-debug-mode.md).
70+
71+
You can monitor active data flow debug sessions across a factory in the **Monitor** experience.
72+
73+
![View data flow debug sessions](media/iterative-development-debugging/view-dataflow-debug-sessions.png)
74+
75+
### Debugging a pipeline with a data flow activity
6976

70-
![After setting a breakpoint on the selected element](media/iterative-development-debugging/iterative-development-image6.png)
77+
When running a debug run with a data flow, you have two options on which compute to use. You can either use an existing debug cluster or spin up a new just-in-time cluster for your data flows.
78+
79+
Using an existing debug session will greatly reduce the data flow start up time as the cluster is already running, but is not recommended for complex or parallel workloads as it may fail when multiple jobs run at once.
80+
81+
Using the activity runtime will create a new cluster using the settings specified in each data flow activity's integration runtime. This allows each job to be isolated and should be used for complex workloads or performance testing.
82+
83+
![Running a pipeline with a dataflow](media/iterative-development-debugging/iterative-development-dataflow.png)
7184

7285
## Next steps
73-
[Continuous integration and deployment in Azure Data Factory](continuous-integration-deployment.md)
86+
87+
After testing your changes, promote them to higher environments using [continuous integration and deployment in Azure Data Factory](continuous-integration-deployment.md).
135 KB
Loading
302 KB
Loading
232 KB
Loading
37.7 KB
Loading
37.8 KB
Loading
79.6 KB
Loading

0 commit comments

Comments
 (0)