Skip to content

Commit b0bedb8

Browse files
authored
Merge pull request #219617 from dearandyxu/master
add SAP CDC template
2 parents 257056f + f1fabb5 commit b0bedb8

File tree

7 files changed

+67
-0
lines changed

7 files changed

+67
-0
lines changed

articles/data-factory/TOC.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1108,6 +1108,8 @@ items:
11081108
href: solution-template-bulk-copy-from-files-to-database.md
11091109
- name: Delta copy from database
11101110
href: solution-template-delta-copy-with-control-table.md
1111+
- name: Replicate data from SAP CDC
1112+
href: solution-template-replicate-multiple-objects-sap-cdc.md
11111113
- name: Detect and mask PII data
11121114
href: solution-template-pii-detection-and-masking.md
11131115
- name: Extract data from PDF source

articles/data-factory/concepts-change-data-capture.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -86,6 +86,12 @@ The followings are the tutorials to start the change data capture in Azure Data
8686
- [SAP CDC tutorial in ADF](sap-change-data-capture-introduction-architecture.md#sap-cdc-capabilities)
8787
- [Incrementally copy data from a source data store to a destination data store tutorials](tutorial-incremental-copy-overview.md)
8888

89+
## Templates
90+
91+
The followings are the templates to use the change data capture in Azure Data Factory and Azure Synapse Analytics.
92+
93+
- [Replicate multiple objects from SAP via SAP CDC](solution-template-replicate-multiple-objects-sap-cdc.md)
94+
8995

9096
## Next steps
9197

10.4 KB
Loading
91.3 KB
Loading
Loading
86.8 KB
Loading
Lines changed: 59 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,59 @@
1+
---
2+
title: Replicate multiple objects from SAP via SAP CDC
3+
description: Learn how to use a solution template to replicate multiple objects from SAP via SAP CDC in Azure Data Factory.
4+
author: dearandyxu
5+
ms.author: yexu
6+
ms.service: data-factory
7+
ms.subservice: tutorials
8+
ms.topic: conceptual
9+
ms.date: 11/28/2022
10+
---
11+
12+
# Replicate multiple objects from SAP via SAP CDC
13+
14+
[!INCLUDE[appliesto-adf-asa-md](includes/appliesto-adf-asa-md.md)]
15+
16+
This article describes a solution template that you can use to replicate multiple ODP objects from SAP via SAP CDC connector to Azure Data Lake Gen2 in Delta format with key partition.
17+
18+
## About this solution template
19+
20+
This template reads an external control file in csv format on your storage store, which contains your SAP ODP contexts, SAP ODP objects and key columns from SAP source system as well as your containers, folders and partitions from Azure Data Lake Gen2 destination store. It then copies each of the SAP ODP object from SAP system to Azure Data Lake Gen2 in Delta format.
21+
22+
The template contains three activities:
23+
- **Lookup** retrieves the SAP ODP objects list to be loaded and the destination store path from an external control file on your Azure Data Lake Gen2 store.
24+
- **ForEach** gets the SAP ODP objects list from the Lookup activity and iterates each object to the mapping dataflow activity.
25+
- **Mapping dataflow** replicates each SAP ODP object from SAP system to Azure Data Lake Gen2 in Delta format. It will do initial full load in the first run and then do incremental load in the subsequent runs automatically. It will merge the changes to Azure Data Lake Gen2 in Delta format.
26+
27+
An external control file in csv format is required for in this template. The schema for the control file is as below.
28+
- *context* is your SAP ODP context from the source SAP system. You can get more details [here](sap-change-data-capture-prepare-linked-service-source-dataset.md#set-up-the-source-dataset).
29+
- *object* is your SAP ODP object name to be loaded from the SAP system. You can get more details [here](sap-change-data-capture-prepare-linked-service-source-dataset.md#set-up-the-source-dataset).
30+
- *keys* are your key column names from SAP ODP objects used to do the dedupe in mapping dataflow.
31+
- *container* is your container name in the Azure Data Lake Gen2 as the destination store.
32+
- *folder* is your folder name in the Azure Data Lake Gen2 as the destination store.
33+
- *partition* is your column name used to create partitions for each unique value in such column to write data into Delta format on Azure Data Lake Gen2 via Spark cluster used by mapping dataflow. You can get more details [here](concepts-data-flow-performance.md#key)
34+
35+
:::image type="content" source="media/solution-template-replicate-multiple-objects-sap-cdc/sap-cdc-template-control-file.png" alt-text="Screenshot of SAP CDC control file.":::
36+
37+
38+
## How to use this solution template
39+
40+
1. Create and upload a control file into CSV format to your Azure Data Lake Gen2 as the destination store. The default container to store the control file is **demo** and default control file name is **SAP2DeltaLookup.csv**.
41+
42+
:::image type="content" source="media/solution-template-replicate-multiple-objects-sap-cdc/sap-cdc-template-control-file.png" alt-text="Screenshot of SAP CDC control file.":::
43+
44+
2. Go to the **Replicate multiple tables from SAP ODP to Azure Data Lake Storage Gen2 in Delta format** template and **click** it.
45+
46+
:::image type="content" source="media/solution-template-replicate-multiple-objects-sap-cdc/sap-cdc-template-search-template.png" alt-text="Screenshot of SAP CDC search template.":::
47+
48+
3. Click **Continue** and input your linked service connected to SAP system as source, and the linked service connected to Azure Data Lake Gen2 as the destination. You can get more details about SAP CDC linked service from [here](sap-change-data-capture-prepare-linked-service-source-dataset.md#set-up-a-linked-service). Be aware that your external control file should be uploaded to the same account of Azure Data Lake Gen2.
49+
50+
:::image type="content" source="media/solution-template-replicate-multiple-objects-sap-cdc/sap-cdc-template-use-template.png" alt-text="Screenshot of SAP CDC use template.":::
51+
52+
4. Click **Use this template** and your will see the pipeline has been ready to use.
53+
54+
:::image type="content" source="media/solution-template-replicate-multiple-objects-sap-cdc/sap-cdc-template-pipeline.png" alt-text="Screenshot of SAP CDC pipeline.":::
55+
56+
## Next steps
57+
58+
- [Azure Data Factory SAP CDC](sap-change-data-capture-introduction-architecture.md)
59+
- [Azure Data Factory change data capture](concepts-change-data-capture.md)

0 commit comments

Comments
 (0)