Skip to content

v2.4.0

Choose a tag to compare

@Jintao-Huang Jintao-Huang released this 13 Sep 04:50
· 1599 commits to main since this release

English Version

New Features:

  1. Support for Liger, which accommodates models like LLaMA, Qwen, Mistral, etc., and reduces memory usage by 10% to 60%.
  2. Support for custom loss function training using a registration mechanism.
  3. Training now supports pushing models to ModelScope and HuggingFace.
  4. Support for the freeze_vit parameter to control the behavior of full parameter training for multimodal models.

New Models:

  1. Qwen2-VL series includes GPTQ/AWQ quantized models. For best practices, see here.
  2. InternVL2 AWQ quantized models.

New Datasets:

  1. qwen2-pro series

中文版

新特性:

  1. 支持 Liger训练LLaMA、Qwen、Mistral 等模型,内存使用降低 10% 至 60%。
  2. 支持使用注册机制进行自定义损失函数的训练。
  3. 训练支持将模型推送至 ModelScope 和 HuggingFace。
  4. 支持 freeze_vit 参数,以控制多模态模型全参数训练的行为。

新模型:

  1. Qwen2-VL 系列包括 GPTQ/AWQ 量化模型,最佳实践可以查看这里
  2. InternVL2 AWQ 量化模型。

新数据集:

  1. qwen2-pro 系列

What's Changed

Full Changelog: v2.3.2...v2.4.0