You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+4-1Lines changed: 4 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -292,6 +292,7 @@ swift sft \
292
292
```
293
293
294
294
#### Deepspeed Training
295
+
Deepspeed supports training of quantized GPTQ and AWQ models.
295
296
296
297
ZeRO2:
297
298
```shell
@@ -432,6 +433,7 @@ CUDA_VISIBLE_DEVICES=0 swift deploy \
432
433
```
433
434
434
435
### Supported Models
436
+
The complete list of supported models and datasets can be found at [Supported Models and Datasets List](https://idealab.alibaba-inc.com/docs/source/LLM/Supported-Models-and-Datasets.md).
435
437
436
438
#### LLMs
437
439
@@ -470,7 +472,8 @@ CUDA_VISIBLE_DEVICES=0 swift deploy \
470
472
| c4ai-command-r |[c4ai](https://cohere.com/command)| Multilingual | 35B-104B | chat model |
471
473
| WizardLM2 |[WizardLM2 series models](https://github.com/nlpxucan/WizardLM)| English | 7B-8x22B<br>including quantized versions | chat model<br>MoE model |
472
474
| Atom |[Atom](https://github.com/LlamaFamily/Llama-Chinese)| Chinese | 7B| base model<br>chat model|
473
-
| Chinese-LLaMA-Alpaca-2 |[Chinese-LLaMA-Alpaca-2](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2)| Chinese | 1.3B-13B| base model<br>chat model<br>long text model|
475
+
| Chinese-LLaMA-Alpaca-2 |[Chinese-LLaMA-Alpaca-2](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2)| Chinese | 1.3B-13B| base model<br>chat model<br>long text model |
476
+
| ModelScope-Agent |[ModelScope Agent series models](https://github.com/modelscope/modelscope-agent)| Chinese | 7B-14B| agent model |
Copy file name to clipboardExpand all lines: docs/source_en/LLM/Customization.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -26,7 +26,7 @@ The corresponding example sh script can be found [here](https://github.com/model
26
26
27
27
2.`--custom_val_dataset_path`: The default value is `[]`, indicating not to use a custom validation dataset. If you specify `custom_train_dataset_path`, then the validation set of the custom dataset will be split according to the command line argument `dataset_test_ratio`.
28
28
29
-
The script supports file formats including `csv`, `json`, and `jsonl`. You need to ensure the passed in files conform to the following dataset formats. csv files only support instruction tuning, i.e. the case without history. json and jsonl files support system and history.
29
+
The supported file formats for the script include `csv`, `json`, and `jsonl`. You need to ensure that the incoming files conform to the following dataset formats. Both `json` and `jsonl` formats support multi-turn dialogues (`csv` does not support this).
0 commit comments