Skip to content

Commit 3714bce

Browse files
committed
fix: newline
Signed-off-by: Will Johnson <[email protected]>
1 parent 2cc74c1 commit 3714bce

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

docs/advanced-data-preprocessing.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,7 @@ Data config is a YAML configuration file which users can provide to `sft_trainer
1515
The data config schema is designed to define datasets and their processing strategies in a structured way. It consists of the following top-level keys:
1616
- `datapreprocessor`: Defines global data processing parameters, such as the type (`default`), sampling stopping strategy (`all_exhausted` or `first_exhausted`), and sampling seed for reproducibility.
1717
- `datasets`: A list of dataset configurations, each describing the dataset name, paths, optional builders, sampling ratios, and data handlers.
18+
1819
At the top level, the data config looks like this:
1920
```yaml
2021
datapreprocessor:

0 commit comments

Comments
 (0)