Skip to content

v0.6.0

Choose a tag to compare

@tharapalanivel tharapalanivel released this 07 Aug 15:40
· 68 commits to main since this release
207eb06

What's Changed

  • fix: enabling block-by-block evaluation for granite-3.x-models by @bayo-ibm in #165
  • fix: pylint false alarm on libdevice functions by @chichun-charlie-liu in #166
  • fix: Add version limits for torchao, ensure compat with 0.12 + AIU by @ani300 in #168
  • feat: Change paged FP8 prefill back to regular attention by @ani300 in #171
  • feat: FP8 requested changes by @ani300 in #173
  • chore(deps): Update triton requirement from <3.4,>=3.0 to >=3.0,<3.5 by @dependabot[bot] in #170
  • chore(deps): Update transformers requirement from <4.54,>=4.45 to >=4.45,<4.56 by @dependabot[bot] in #172
  • fix: FP8 TP fixes by @ani300 in #176

Full Changelog: v0.5.0...v0.6.0