Skip to content

Commit 804bfca

Browse files
authored
Update README with xLLM news entry
Added news entry about xLLM high-performance inference engine.
1 parent f29965e commit 804bfca

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,7 @@ ScaleLLM
3737
ScaleLLM is currently undergoing active development. We are fully committed to consistently enhancing its efficiency while also incorporating additional features. Feel free to explore our [**_Roadmap_**](https://github.com/vectorch-ai/ScaleLLM/issues/84) for more details.
3838

3939
## News:
40+
* [08/2025] - [xLLM](https://github.com/jd-opensource/xllm?tab=readme-ov-file#7-acknowledgment) high-performance inference engine builds runtime execution based on ScaleLLM.
4041
* [01/2025] - Optimized inhouse Attention kernels
4142
* [06/2024] - ScaleLLM is now available on [PyPI](https://pypi.org/project/scalellm/). You can install it using `pip install scalellm`.
4243
* [03/2024] - [Advanced features](#advanced-features) support for ✅ [CUDA graph](#cuda-graph), ✅ [prefix cache](#prefix-cache), ✅ [chunked prefill](#chunked-prefill) and ✅ [speculative decoding](#speculative-decoding).

0 commit comments

Comments
 (0)