Skip to content

Release v0.2.4

Choose a tag to compare

@yanghaojin yanghaojin released this 22 May 22:26
· 4 commits to main since this release
8addc3e

Added

  • Tuned the hyperparameters of DiodeMix optimizer for sft.
  • Added sft-support for the classical gptq-style models.
  • Implemented qzeros update in finetuning process.

Updated

  • Extended pack_fp_weight function.
  • Enhanced the performance of MPQLinearCUDA layer.

Fixed

  • Fixed various errors in DiodeMix update function.