Skip to content

0.0.9

Choose a tag to compare

@github-actions github-actions released this 13 Oct 21:42
· 68 commits to master since this release
  • Lock MCG and MUL1 multipliers, no longer flag as experimental
  • Switch to MCG codebook by default to new models (use --codebook 3inst for previous default)
  • Add more calibration data
  • Increase default calibration size to 250 rows (use --cal_rows 100 for previous default)
  • Fix quantized cache for bsz > 1
  • Fix kernel selection on A100
  • A few more TP-related fixes

Full Changelog: v0.0.8...v0.0.9