|
47 | 47 | - download_mode: Dataset download mode, including `reuse_dataset_if_exists` and `force_redownload`, default is reuse_dataset_if_exists. |
48 | 48 | - columns: Used for column mapping of the dataset to ensure that the dataset conforms to the format that AutoPreprocessor can handle. For more details, see [here](../Customization/Custom-dataset.md). You can pass in a JSON string, for example: `'{"text1": "query", "text2": "response"}'`, with the default being None. |
49 | 49 | - strict: If set to True, any row with an issue in the dataset will throw an error immediately, otherwise, erroneous data samples will be discarded. Default is False. |
50 | | -- 🔥model_name: Used only for self-awareness tasks, pass in the Chinese and English names of the model, separated by a space, e.g., `--model_name Xiao Huang 'Xiao Huang'`. Default is None. |
51 | | -- 🔥model_author: Used only for self-awareness tasks, pass in the Chinese and English names of the model author, separated by a space, e.g., `--model_author '魔搭' 'ModelScope'`. Default is None. |
| 50 | +- 🔥model_name: Only applicable to the self-cognition task and effective only on the `swift/self-cognition` dataset. It replaces the `{{NAME}}` placeholder in the dataset. Input the model's name in both Chinese and English, separated by a space, for example: `--model_name 小黄 'Xiao Huang'`. Default is None. |
| 51 | +- 🔥model_author: Only applicable to the self-cognition task and effective only on the `swift/self-cognition` dataset. It replaces the `{{AUTHOR}}` placeholder in the dataset. Input the model author's name in both Chinese and English, separated by a space, for example: `--model_author '魔搭' 'ModelScope'`. Default is None. |
52 | 52 | - custom_dataset_info: The path to the JSON file for custom dataset registration. Refer to [Custom Dataset](../Customization/Custom-dataset.md). Default is `[]`. |
53 | 53 |
|
54 | 54 |
|
@@ -117,6 +117,7 @@ This parameter list inherits from transformers `Seq2SeqTrainingArguments`, with |
117 | 117 | - remove_unused_columns: Whether to remove unused columns in the dataset, defaults to False. |
118 | 118 | - logging_first_step: Whether to log the first step, defaults to True. |
119 | 119 | - logging_steps: Interval for logging, defaults to 5. |
| 120 | +- predict_with_generate: Whether to use generative method during validation, default is False. |
120 | 121 | - metric_for_best_model: Defaults to None, which sets it to 'loss' when `predict_with_generate` is False, otherwise sets it to 'rouge-l'. |
121 | 122 | - greater_is_better: Defaults to None, which sets it to False when `metric_for_best_model` contains 'loss', otherwise sets to True. |
122 | 123 |
|
@@ -339,6 +340,7 @@ RLHF arguments inherit from the [training arguments](#training-arguments). |
339 | 340 | - simpo_gamma: Reward margin term in the SimPO algorithm, with a paper-suggested setting of 0.5-1.5, default is `1.`. |
340 | 341 | - desirable_weight: Loss weight $\lambda_D$ for desirable response in the KTO algorithm, default is `1.`. |
341 | 342 | - undesirable_weight: Loss weight $\lambda_U$ for undesirable response in the KTO algorithm, default is `1.`. |
| 343 | +- loss_scale: Override template arguments, default is 'last_round'. |
342 | 344 |
|
343 | 345 | #### PPO Arguments |
344 | 346 |
|
|
0 commit comments