-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
用户体验
- 文档优化补充
- 刷新特性支持情况
- 增加部署指导
软件分发
- x86 openeuler镜像支持
- 国内镜像源同步
社区量化支持
- sglang社区量化roadmap [Roadmap] Quantization Support sgl-project/sglang#8180
- GPTQ
- Compressed Tensors
Agent
- function call能力增强
新模型支持
- NA
vllm vs sglang
- sglang triton 算子分发机制分析
- sglang框架npu 适配分析 (接入昇腾的方式比vLLM代价小)
参考资料
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels