Skip to content

v1.3.0 Release

Choose a tag to compare

@tastelikefeet tastelikefeet released this 09 Nov 03:55
· 2468 commits to main since this release

English Version

New Features:

  1. Serving supported: LoRA and full-parameter training models are supported in vllm/chatglm.cpp/xinference deployment, check the documentation for details by make docs or docs/source/GetStarted/Deployment.md file.
  2. Support training and inference with CLI and inference with Web-UI.

New Adapters:

  • QALoRA
  • Long-LoRA
  • ROME

New Models:

  1. xverse-65b
  2. yi-6b
  3. ziya2-13b
  4. ziya2-13b-chat
  5. mistral-7b
  6. openbuddy-mistral-7b-chat
  7. mistral-7b-chat
  8. chatglm3-6b-base
  9. chatglm3-6b
  10. chatglm3-6b-32k

New Quantized Models:

  1. qwen-7b-chat-int4
  2. qwen-14b-chat-int4
  3. qwen-vl-chat-int4
  4. baichuan2-7b-chat-int4
  5. baichuan2-13b-chat-int4
  6. qwen-7b-chat-int8
  7. qwen-14b-chat-int8

中文版
新功能:

  1. 支持部署:全参数训练及LoRA训练支持以vllm/chatglm.cpp/xinference方式进行部署,可以通过make docs生成官方文档或查看docs/source/GetStarted/Deployment.md文件
  2. 支持CLI方式运行训练和推理,同时支持以Web-UI方式运行推理

新的Adapters:

  • QALoRA
  • Long-LoRA
  • ROME

支持训练和推理的新模型:

  1. xverse-65b
  2. yi-6b
  3. ziya2-13b
  4. ziya2-13b-chat
  5. mistral-7b
  6. openbuddy-mistral-7b-chat
  7. mistral-7b-chat
  8. chatglm3-6b-base
  9. chatglm3-6b
  10. chatglm3-6b-32k

支持训练和推理的新量化模型:

  1. qwen-7b-chat-int4
  2. qwen-14b-chat-int4
  3. qwen-vl-chat-int4
  4. baichuan2-7b-chat-int4
  5. baichuan2-13b-chat-int4
  6. qwen-7b-chat-int8
  7. qwen-14b-chat-int8

Feature Commits

Bug Fix:

New Contributors

  • @zzclynn made their first contribution in #105

Full Changelog: v1.2.0...v1.3.0