Releases: xlite-dev/Awesome-LLM-Inference
Releases · xlite-dev/Awesome-LLM-Inference
v1.6
Full Changelog: DefTruth/Awesome-LLM-Inference@v1.5...v1.6
v1.5
What's Changed
- add MInference 1.0 from microsoft by @liyucheng09 in https://github.com/DefTruth/Awesome-LLM-Inference/pull/20
Full Changelog: DefTruth/Awesome-LLM-Inference@v1.3...v1.5
v1.3
What's Changed
- [MoA] MoA: Mixture of Sparse Attention for Automatic LLM Compression by @liyucheng09 in https://github.com/DefTruth/Awesome-LLM-Inference/pull/19
Full Changelog: DefTruth/Awesome-LLM-Inference@v1.2...v1.3
v1.2
What's Changed
- Update README.md by @Kthyeon in https://github.com/DefTruth/Awesome-LLM-Inference/pull/18
New Contributors
- @Kthyeon made their first contribution in https://github.com/DefTruth/Awesome-LLM-Inference/pull/18
Full Changelog: DefTruth/Awesome-LLM-Inference@v1.1...v1.2
v1.1
Full Changelog: DefTruth/Awesome-LLM-Inference@v1.0...v1.1
v1.0
Full Changelog: DefTruth/Awesome-LLM-Inference@v0.9...v1.0
v0.9
What's Changed
- update [Decoding Speculative Decoding] github repo by @KylinC in https://github.com/DefTruth/Awesome-LLM-Inference/pull/16
New Contributors
- @KylinC made their first contribution in https://github.com/DefTruth/Awesome-LLM-Inference/pull/16
Full Changelog: DefTruth/Awesome-LLM-Inference@v0.8...v0.9
v0.8
Full Changelog: DefTruth/Awesome-LLM-Inference@v0.7...v0.8
v0.7
What's Changed
- LLMLingua-2 by @liyucheng09 in https://github.com/DefTruth/Awesome-LLM-Inference/pull/11
- add SnapKV by @liyucheng09 in https://github.com/DefTruth/Awesome-LLM-Inference/pull/12
- Add Microbenchmark by @Miroier in https://github.com/DefTruth/Awesome-LLM-Inference/pull/14
- [KVcache] add "Gear" paper and code of "Keyformer" by @HarryWu-CHN in https://github.com/DefTruth/Awesome-LLM-Inference/pull/13
- Update README.md by @preminstrel in https://github.com/DefTruth/Awesome-LLM-Inference/pull/15
New Contributors
- @Miroier made their first contribution in https://github.com/DefTruth/Awesome-LLM-Inference/pull/14
- @HarryWu-CHN made their first contribution in https://github.com/DefTruth/Awesome-LLM-Inference/pull/13
- @preminstrel made their first contribution in https://github.com/DefTruth/Awesome-LLM-Inference/pull/15
Full Changelog: DefTruth/Awesome-LLM-Inference@v0.6...v0.7
Awesome-LLM-Inference v0.6
What's Changed
- Add an ICLR paper for KV cache compression by @Janghyun1230 in https://github.com/DefTruth/Awesome-LLM-Inference/pull/8
- Add github link for paper FP8-Quantization[2208.09225] by @Mr-Philo in https://github.com/DefTruth/Awesome-LLM-Inference/pull/9
New Contributors
- @Janghyun1230 made their first contribution in https://github.com/DefTruth/Awesome-LLM-Inference/pull/8
- @Mr-Philo made their first contribution in https://github.com/DefTruth/Awesome-LLM-Inference/pull/9
Full Changelog: DefTruth/Awesome-LLM-Inference@v0.5...v0.6