v0.6.0
What's Changed
- fix: enabling block-by-block evaluation for granite-3.x-models by @bayo-ibm in #165
- fix: pylint false alarm on libdevice functions by @chichun-charlie-liu in #166
- fix: Add version limits for torchao, ensure compat with 0.12 + AIU by @ani300 in #168
- feat: Change paged FP8 prefill back to regular attention by @ani300 in #171
- feat: FP8 requested changes by @ani300 in #173
- chore(deps): Update triton requirement from <3.4,>=3.0 to >=3.0,<3.5 by @dependabot[bot] in #170
- chore(deps): Update transformers requirement from <4.54,>=4.45 to >=4.45,<4.56 by @dependabot[bot] in #172
- fix: FP8 TP fixes by @ani300 in #176
Full Changelog: v0.5.0...v0.6.0