Skip to content

v0.2.1

Compare
Choose a tag to compare
@github-actions github-actions released this 16 Oct 20:01
· 9790 commits to main since this release
651c614

Major Changes

  • PagedAttention V2 kernel: Up to 20% end-to-end latency reduction
  • Support log probabilities for prompt tokens
  • AWQ support for Mistral 7B

What's Changed

New Contributors

Full Changelog: v0.2.0...v0.2.1