v2.6.5
What's Changed
- Add DP/TP/SP/CP papers with codes by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/92
- 🔥🔥[SP: BPT] Blockwise Parallel Transformer for Large Context Models by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/93
- 🔥🔥[TP: Comm Compression] Communication Compression for Tensor Parallel LLM Inference by @DefTruth in https://github.com/DefTruth/Awesome-LLM-Inference/pull/94
Full Changelog: DefTruth/Awesome-LLM-Inference@v2.6.4...v2.6.5