Skip to content

Commit 4441e4c

Browse files
committed
Fix bold
1 parent 26cb468 commit 4441e4c

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

docs/sft_data_preprocessing.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -94,9 +94,9 @@ Key Parameters
9494
| :--- | :--- | :--- | :--- |
9595
| **`--output_dir`** | `str` | Yes | The directory to save the final WebDataset files |
9696
| **`--json_file`** | `str` | Yes | The path to the main JSON file containing the dataset metadata. |
97-
| **`--image_dir`** | `str` | No | The directory path containing the image files. Required if `media` is `image` or `mix`. |
98-
| **`--video_dir`** | `str` | No | The directory path containing the video files. Required if `media` is `video` or `mix`. |
99-
| **`--media`** | `str` | No | The type of media to process: `image`, `video`, or `mix` (default: `mix`). |
97+
| **`--image_dir`** | `str` | No | The directory path containing the image files. Required if **media** is **image** or **mix**. |
98+
| **`--video_dir`** | `str` | No | The directory path containing the video files. Required if **media** is **image** or **mix**. |
99+
| **`--media`** | `str` | No | The type of media to process: **image**, **video**, or **mix** (default: **mix**). |
100100
| **`--maxcount`** | `int` | No | Maximum number of samples per WebDataset shard (default: $10000$). |
101101
| **`--maxsize`** | `int` | No | Maximum byte size of each shard (default: $3$ GB). |
102-
| **`--columns_messages`** | `str` | No | The key in the JSON entry that holds the conversational messages (default: `messages`). |
102+
| **`--columns_messages`** | `str` | No | The key in the JSON entry that holds the conversational messages (default: **messages**). |

0 commit comments

Comments
 (0)