update news and citation (#3889)

lvhan028 · web-flow · commit 01b52f92e16d · 2025-08-27T11:55:38.000+08:00
* update news and citation

* update
diff --git a/README.md b/README.md
@@ -25,6 +25,12 @@ ______________________________________________________________________
 
 <details open>
 <summary><b>2025</b></summary>
+
+- \[2025/06\] Comprehensive inference optimization for FP8 MoE Models
+- \[2025/06\] DeepSeek PD Disaggregation deployment is now supported through integration with [DLSlime](https://github.com/DeepLink-org/DLSlime) and [Mooncake](https://github.com/kvcache-ai/Mooncake). Huge thanks to both teams!
+- \[2025/04\] Enhance DeepSeek inference performance by integration deepseek-ai techniques: FlashMLA, DeepGemm, DeepEP, MicroBatch and eplb
+- \[2025/01\] Support DeepSeek V3 and R1
+
 </details>
 
 <details close>
@@ -270,6 +276,15 @@ We appreciate all contributions to LMDeploy. Please refer to [CONTRIBUTING.md](.
 }
 ```
 
+```bibtex
+@article{zhang2025efficient,
+  title={Efficient Mixed-Precision Large Language Model Inference with TurboMind},
+  author={Zhang, Li and Jiang, Youhe and He, Guoliang and Chen, Xin and Lv, Han and Yao, Qian and Fu, Fangcheng and Chen, Kai},
+  journal={arXiv preprint arXiv:2508.15601},
+  year={2025}
+}
+```
+
 # License
 
 This project is released under the [Apache 2.0 license](LICENSE).
diff --git a/README_zh-CN.md b/README_zh-CN.md
@@ -27,6 +27,11 @@ ______________________________________________________________________
 <summary><b>2025</b></summary>
 </details>
 
+- 【2025年6月】深度优化 FP8 MoE 模型推理
+- 【2025年6月】集成[DLSlime](https://github.com/DeepLink-org/DLSlime)和[Mooncake](https://github.com/kvcache-ai/Mooncake)，实现DeepSeek PD分离部署，向两个团队表示诚挚的感谢！
+- 【2025年4月】集成deepseek-ai组件FlashMLA、DeepGemm、DeepEP、MicroBatch、eplb等，提升DeepSeek推理性能
+- 【2025年1月】新增对DeepSeek V3及R1的支持
+
 <details close>
 <summary><b>2024</b></summary>
 
@@ -270,6 +275,15 @@ with lmdeploy.pipeline("internlm/internlm3-8b-instruct") as pipe:
 }
 ```
 
+```bibtex
+@article{zhang2025efficient,
+  title={Efficient Mixed-Precision Large Language Model Inference with TurboMind},
+  author={Zhang, Li and Jiang, Youhe and He, Guoliang and Chen, Xin and Lv, Han and Yao, Qian and Fu, Fangcheng and Chen, Kai},
+  journal={arXiv preprint arXiv:2508.15601},
+  year={2025}
+}
+```
+
 # 开源许可证
 
 该项目采用 [Apache 2.0 开源许可证](LICENSE)。