v1.8
What's Changed
- 🔥[flashinfer] FlashInfer: Kernel Library for LLM Serving(@flashinfer-ai) by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/24
- 🔥[Palu] Palu: Compressing KV-Cache with Low-Rank Projection(@nycu.edu… by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/25
- 🔥[SentenceVAE] SentenceVAE: Faster, Longer and More Accurate Inferenc… by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/26
- Bump up to v1.8 by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/27
Full Changelog: DefTruth/Awesome-LLM-Inference@v1.7...v1.8