Skip to content

Commit 8968132

Browse files
Merge pull request #79605 from kromerm/dataflow-1
Dataflow 1 - Updated Source Transform
2 parents 943f26e + cd94090 commit 8968132

File tree

1 file changed

+30
-0
lines changed

1 file changed

+30
-0
lines changed

articles/data-factory/data-flow-source.md

Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -79,10 +79,40 @@ Choose settings to manage files in your source.
7979
![New source settings](media/data-flow/source2.png "New settings")
8080

8181
* **Wildcard path**: From your source folder, choose a series of files that match a pattern. This setting overrides any file in your dataset definition.
82+
83+
Wildcard examples:
84+
85+
* ```*``` Represents any set of characters
86+
* ```**``` Represents recursive directory nesting
87+
* ```?``` Replaces one character
88+
* ```[]``` Matches one of more characters in the brackets
89+
90+
* ```/data/sales/**/*.csv``` Gets all csv files under /data/sales
91+
* ```/data/sales/20??/**``` Gets all files in the 20th century
92+
* ```/data/sales/2004/*/12/[XY]1?.csv``` Gets all csv files in 2004 in December starting with X or Y prefixed by a 2-digit number
93+
94+
Container has to be specified in the dataset. Your wildcard path must therefore also include your folder path from the root folder.
95+
8296
* **List of files**: This is a file set. Create a text file that includes a list of relative path files to process. Point to this text file.
8397
* **Column to store file name**: Store the name of the source file in a column in your data. Enter a new name here to store the file name string.
8498
* **After completion**: Choose to do nothing with the source file after the data flow runs, delete the source file, or move the source file. The paths for the move are relative.
8599

100+
To move source files to another location post-processing, first select "Move" for file operation. Then, set the "from" directory. If you are not using any wildcards for your path, then the "from" setting will be the same folder as your source folder.
101+
102+
If you have a wildcarded source path, ex:
103+
104+
```/data/sales/20??/**/*.csv```
105+
106+
You can specify "from" as
107+
108+
```/data/sales```
109+
110+
And "to" as
111+
112+
```/backup/priorSales```
113+
114+
In this case, all subdirectories under /data/sales which were sourced are moved relative to /backup/priorSales.
115+
86116
### SQL datasets
87117

88118
If your source is in SQL Database or SQL Data Warehouse, you have additional options for source file management.

0 commit comments

Comments
 (0)