v1.9
What's Changed
- 🔥[DynamoLLM] DynamoLLM: Designing LLM Inference Clusters for Performa… by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/28
- 🔥[Zero-Delay QKV Compression] Zero-Delay QKV Compression for Mitigati… by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/29
- 🔥[Automatic Inference Engine Tuning] Towards SLO-Optimized LLM Servin… by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/30
- 🔥🔥[500xCompressor] 500xCompressor: Generalized Prompt Compression for… by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/31
- Bump up to v1.9 by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/32
Full Changelog: DefTruth/Awesome-LLM-Inference@v1.8...v1.9