Skip to content

Commit c53843f

Browse files
authored
Merge pull request #78519 from kromerm/dataflow-1
Update concepts-data-flow-performance.md
2 parents df96a26 + 8187941 commit c53843f

File tree

1 file changed

+4
-1
lines changed

1 file changed

+4
-1
lines changed

articles/data-factory/concepts-data-flow-performance.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,11 +38,14 @@ Clicking that icon will display the execution plan and subsequent performance pr
3838

3939
![Source Part](media/data-flow/sourcepart2.png "Source Part")
4040

41-
### You can match Spark data partitioning to your source database partitioning based on a database table column key in the source transformation
41+
### Partition your source data
4242

4343
* Go to "Optimize" and select "Source". Set either a specific table column or a type in a query.
4444
* If you chose "column", then pick the partition column.
4545
* Also, set the maximum number of connections to your Azure SQL DB. You can try a higher setting to gain parallel connections to your database. However, some cases may result in faster performance with a limited number of connections.
46+
* Your source database tables do not need to be partitioned.
47+
* Setting a query in your Source transformation that matches the partitioning scheme of your database table will allow the source database engine to leverage partition elimination.
48+
* If your source is not already partitioned, ADF will still use data partitioning in the Spark transformation environment based on the key that you select in the Source transformation.
4649

4750
### Set batch size and query on source
4851

0 commit comments

Comments
 (0)