v2.6.11
What's Changed
- add
MiniMax-01in Trending LLM/VLM Topics and Long Context Attention by @shaoyuyoung in https://github.com/DefTruth/Awesome-LLM-Inference/pull/112 - [feat] add deepseek-r1 by @shaoyuyoung in https://github.com/DefTruth/Awesome-LLM-Inference/pull/113
- 🔥🔥[DistServe] DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/114
- 🔥🔥[KVDirect] KVDirect: Distributed Disaggregated LLM Inference by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/115
- 🔥🔥[DeServe] DESERVE: TOWARDS AFFORDABLE OFFLINE LLM INFERENCE VIA DECENTRALIZATION by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/116
- 🔥🔥[Mooncake] Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/117
New Contributors
- @shaoyuyoung made their first contribution in https://github.com/DefTruth/Awesome-LLM-Inference/pull/112
Full Changelog: DefTruth/Awesome-LLM-Inference@v2.6.10...v2.6.11