Skip to content

Commit e86c777

Browse files
committed
adding schema note
1 parent d347c11 commit e86c777

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

articles/data-factory/data-flow-aggregate.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.reviewer: daperlov
77
ms.service: data-factory
88
ms.topic: conceptual
99
ms.custom: seo-lt-2019
10-
ms.date: 10/15/2019
10+
ms.date: 03/24/2020
1111
---
1212

1313
# Aggregate transformation in mapping data flow
@@ -47,7 +47,7 @@ A common use of the aggregate transformation is removing or identifying duplicat
4747

4848
![Deduplication](media/data-flow/agg-dedupe.png "Deduplication")
4949

50-
In the above example, columns `ProductID` and `Name` are being use for grouping. If two rows have the same values for those two columns, they're considered duplicates. In this aggregate transformation, the values of the first row matched will be kept and all others will be dropped. Using column pattern syntax, all columns whose names aren't `ProductID` and `Name` are mapped to their existing column name and given the value of the first matched rows.
50+
In the above example, columns `ProductID` and `Name` are being use for grouping. If two rows have the same values for those two columns, they're considered duplicates. In this aggregate transformation, the values of the first row matched will be kept and all others will be dropped. Using column pattern syntax, all columns whose names aren't `ProductID` and `Name` are mapped to their existing column name and given the value of the first matched rows. The output schema is the same as the input schema.
5151

5252
For data validation scenarios, the `count()` function can be used to count how many duplicates there are.
5353

0 commit comments

Comments
 (0)