@@ -96,10 +96,11 @@ SWIFT(Scalable lightWeight Infrastructure for Fine-Tuning)是一个可扩展
9696- 支持的SFT方法: [ lora] ( https://arxiv.org/abs/2106.09685 ) , [ qlora] ( https://arxiv.org/abs/2305.14314 ) , 全参数微调
9797- 支持的特性: 模型量化, DDP, 模型并行, gradient checkpointing, 支持推送ModelScope Hub, 自定义数据集, 多模态和Agent SFT, 多轮对话, ...
9898- 支持的模型
99- - 通用:
100- - qwen 系列: [qwen-1_8b-chat](https://modelscope.cn/models/qwen/Qwen-1_8B/summary), [qwen-1_8b-chat-int4](https://modelscope.cn/models/qwen/Qwen-1_8B-Chat-Int4/summary), [qwen-1_8b-chat-int8](https://modelscope.cn/models/qwen/Qwen-1_8B-Chat-Int8/summary), [qwen-7b](https://modelscope.cn/models/qwen/Qwen-7B/summary), [qwen-7b-chat](https://modelscope.cn/models/qwen/Qwen-7B-Chat/summary), [qwen-7b-chat-int4](https://modelscope.cn/models/qwen/Qwen-7B-Chat-Int4/summary), [qwen-7b-chat-int8](https://modelscope.cn/models/qwen/Qwen-7B-Chat-Int8/summary), [qwen-14b](https://modelscope.cn/models/qwen/Qwen-14B/summary), [qwen-14b-chat](https://modelscope.cn/models/qwen/Qwen-14B-Chat/summary), [qwen-14b-chat-int4](https://modelscope.cn/models/qwen/Qwen-14B-Chat-Int4/summary), [qwen-14b-chat-int8](https://modelscope.cn/models/qwen/Qwen-14B-Chat-Int8/summary), [qwen-72b](https://modelscope.cn/models/qwen/Qwen-72B/summary), [qwen-72b-chat](https://modelscope.cn/models/qwen/Qwen-72B-Chat/summary), [qwen-72b-chat-int4](https://modelscope.cn/models/qwen/Qwen-72B-Chat-Int4/summary), [qwen-72b-chat-int8](https://modelscope.cn/models/qwen/Qwen-72B-Chat-Int8/summary)
99+ - 多模态:
101100 - qwen-vl 系列: [ qwen-vl] ( https://modelscope.cn/models/qwen/Qwen-VL/summary ) , [ qwen-vl-chat] ( https://modelscope.cn/models/qwen/Qwen-VL-Chat/summary ) , [ qwen-vl-chat-int4] ( https://modelscope.cn/models/qwen/Qwen-VL-Chat-Int4/summary )
102101 - qwen-audio 系列: [ qwen-audio] ( https://modelscope.cn/models/qwen/Qwen-Audio/summary ) , [ qwen-audio-chat] ( https://modelscope.cn/models/qwen/Qwen-Audio-Chat/summary )
102+ - 通用:
103+ - qwen 系列: [qwen-1_8b-chat](https://modelscope.cn/models/qwen/Qwen-1_8B/summary), [qwen-1_8b-chat-int4](https://modelscope.cn/models/qwen/Qwen-1_8B-Chat-Int4/summary), [qwen-1_8b-chat-int8](https://modelscope.cn/models/qwen/Qwen-1_8B-Chat-Int8/summary), [qwen-7b](https://modelscope.cn/models/qwen/Qwen-7B/summary), [qwen-7b-chat](https://modelscope.cn/models/qwen/Qwen-7B-Chat/summary), [qwen-7b-chat-int4](https://modelscope.cn/models/qwen/Qwen-7B-Chat-Int4/summary), [qwen-7b-chat-int8](https://modelscope.cn/models/qwen/Qwen-7B-Chat-Int8/summary), [qwen-14b](https://modelscope.cn/models/qwen/Qwen-14B/summary), [qwen-14b-chat](https://modelscope.cn/models/qwen/Qwen-14B-Chat/summary), [qwen-14b-chat-int4](https://modelscope.cn/models/qwen/Qwen-14B-Chat-Int4/summary), [qwen-14b-chat-int8](https://modelscope.cn/models/qwen/Qwen-14B-Chat-Int8/summary), [qwen-72b](https://modelscope.cn/models/qwen/Qwen-72B/summary), [qwen-72b-chat](https://modelscope.cn/models/qwen/Qwen-72B-Chat/summary), [qwen-72b-chat-int4](https://modelscope.cn/models/qwen/Qwen-72B-Chat-Int4/summary), [qwen-72b-chat-int8](https://modelscope.cn/models/qwen/Qwen-72B-Chat-Int8/summary)
103104 - chatglm 系列: [ chatglm2-6b] ( https://modelscope.cn/models/ZhipuAI/chatglm2-6b/summary ) , [ chatglm2-6b-32k] ( https://modelscope.cn/models/ZhipuAI/chatglm2-6b-32k/summary ) , [ chatglm3-6b-base] ( https://modelscope.cn/models/ZhipuAI/chatglm3-6b-base/summary ) , [ chatglm3-6b] ( https://modelscope.cn/models/ZhipuAI/chatglm3-6b/summary ) , [ chatglm3-6b-32k] ( https://modelscope.cn/models/ZhipuAI/chatglm3-6b-32k/summary )
104105 - baichuan 系列: [ baichuan-7b] ( https://modelscope.cn/models/baichuan-inc/baichuan-7B/summary ) , [ baichuan-13b] ( https://modelscope.cn/models/baichuan-inc/Baichuan-13B-Base/summary ) , [ baichuan-13b-chat] ( https://modelscope.cn/models/baichuan-inc/Baichuan-13B-Chat/summary ) , [ baichuan2-7b] ( https://modelscope.cn/models/baichuan-inc/Baichuan2-7B-Base/summary ) , [ baichuan2-7b-chat] ( https://modelscope.cn/models/baichuan-inc/Baichuan2-7B-Chat/summary ) , [ baichuan2-13b] ( https://modelscope.cn/models/baichuan-inc/Baichuan2-13B-Base/summary ) , [ baichuan2-13b-chat] ( https://modelscope.cn/models/baichuan-inc/Baichuan2-13B-Chat/summary ) , [ baichuan2-7b-chat-int4] ( https://modelscope.cn/models/baichuan-inc/Baichuan2-7B-Chat-4bits/summary ) , [ baichuan2-13b-chat-int4] ( https://modelscope.cn/models/baichuan-inc/Baichuan2-13B-Chat-4bits/summary )
105106 - llama 系列: [ llama2-7b] ( https://modelscope.cn/models/modelscope/Llama-2-7b-ms/summary ) , [ llama2-7b-chat] ( https://modelscope.cn/models/modelscope/Llama-2-7b-chat-ms/summary ) , [ llama2-13b] ( https://modelscope.cn/models/modelscope/Llama-2-13b-ms/summary ) , [ llama2-13b-chat] ( https://modelscope.cn/models/modelscope/Llama-2-13b-chat-ms/summary ) , [ llama2-70b] ( https://modelscope.cn/models/modelscope/Llama-2-70b-ms/summary ) , [ llama2-70b-chat] ( https://modelscope.cn/models/modelscope/Llama-2-70b-chat-ms/summary )
@@ -120,7 +121,7 @@ SWIFT(Scalable lightWeight Infrastructure for Fine-Tuning)是一个可扩展
120121 - NLP:
121122 - 通用: 🔥[ alpaca-en] ( https://modelscope.cn/datasets/AI-ModelScope/alpaca-gpt4-data-en/summary ) (gpt4), 🔥[ alpaca-zh] ( https://modelscope.cn/datasets/AI-ModelScope/alpaca-gpt4-data-zh/summary ) (gpt4), [ multi-alpaca-all] ( https://www.modelscope.cn/datasets/damo/nlp_polylm_multialpaca_sft/summary ) , [ instinwild-en] ( https://www.modelscope.cn/datasets/wyj123456/instinwild/summary ) , [ instinwild-zh] ( https://www.modelscope.cn/datasets/wyj123456/instinwild/summary ) , [ cot-en] ( https://www.modelscope.cn/datasets/YorickHe/CoT/summary ) , [ cot-zh] ( https://www.modelscope.cn/datasets/YorickHe/CoT/summary ) , [ firefly-all-zh] ( https://www.modelscope.cn/datasets/wyj123456/firefly/summary ) , [ instruct-en] ( https://www.modelscope.cn/datasets/wyj123456/instruct/summary ) , [ gpt4all-en] ( https://www.modelscope.cn/datasets/wyj123456/GPT4all/summary ) , [ sharegpt-en] ( https://www.modelscope.cn/datasets/huangjintao/sharegpt/summary ) , [ sharegpt-zh] ( https://www.modelscope.cn/datasets/huangjintao/sharegpt/summary )
122123 - Agent: [ damo-agent-zh] ( https://modelscope.cn/datasets/damo/MSAgent-Bench/summary ) , 🔥[ damo-agent-mini-zh] ( https://modelscope.cn/datasets/damo/MSAgent-Bench/summary ) , 🔥[ agent-instruct-all-en] ( https://modelscope.cn/datasets/ZhipuAI/AgentInstruct/summary )
123- - 代码: [ code-alpaca-en] ( https://www.modelscope.cn/datasets/wyj123456/code_alpaca_en/summary ) , 🔥[ leetcode-python-en] ( https://modelscope.cn/datasets/AI-ModelScope/leetcode-solutions-python/summary ) , 🔥[ codefuse-python-zh ] ( https://modelscope.cn/datasets/codefuse-ai/CodeExercise-Python-27k/summary ) , 🔥[ codefuse-evol-instruction] ( https://modelscope.cn/datasets/codefuse-ai/Evol-instruction-66k/summary )
124+ - 代码: [ code-alpaca-en] ( https://www.modelscope.cn/datasets/wyj123456/code_alpaca_en/summary ) , 🔥[ leetcode-python-en] ( https://modelscope.cn/datasets/AI-ModelScope/leetcode-solutions-python/summary ) , 🔥[ codefuse-python-en ] ( https://modelscope.cn/datasets/codefuse-ai/CodeExercise-Python-27k/summary ) , 🔥[ codefuse-evol-instruction-zh ] ( https://modelscope.cn/datasets/codefuse-ai/Evol-instruction-66k/summary )
124125 - 医疗: [ medical-en] ( https://www.modelscope.cn/datasets/huangjintao/medical_zh/summary ) , [ medical-zh] ( https://www.modelscope.cn/datasets/huangjintao/medical_zh/summary ) , [ medical-mini-zh] ( https://www.modelscope.cn/datasets/huangjintao/medical_zh/summary )
125126 - 法律: 🔥[ lawyer-llama-zh] ( https://modelscope.cn/datasets/AI-ModelScope/lawyer_llama_data/summary ) , [ tigerbot-law-zh] ( https://modelscope.cn/datasets/AI-ModelScope/tigerbot-law-plugin/summary )
126127 - 数学: 🔥[ blossom-math-zh] ( https://modelscope.cn/datasets/AI-ModelScope/blossom-math-v2/summary ) , [ school-math-zh] ( https://modelscope.cn/datasets/AI-ModelScope/school_math_0.25M/summary )
0 commit comments