v2.6.12
What's Changed
- Add Multi-head Latent Attention(MLA) topic by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/118
Full Changelog: DefTruth/Awesome-LLM-Inference@v2.6.11...v2.6.12
Full Changelog: DefTruth/Awesome-LLM-Inference@v2.6.11...v2.6.12