Skip to content

Pull requests: alibaba/rtp-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

test: trigger rebuild for AMD CI testing
#658 opened Jan 30, 2026 by Zoe923 Loading…
fix - w13 fused
#657 opened Jan 30, 2026 by jianglan89 Loading…
add comment optimize
#654 opened Jan 29, 2026 by guoj14 Loading…
fix: fix deepep base setting
#648 opened Jan 28, 2026 by Vinkle-hzt Loading…
[Feature] Support Per-Expert-Overlap(PEO) capability
#643 opened Jan 27, 2026 by yykzjh Loading…
Revert "chore: rm useless device sync"
#638 opened Jan 27, 2026 by Vinkle-hzt Loading…
fix: refactor symm memory to avoid capture oom
#634 opened Jan 26, 2026 by JackTan25 Loading…
feat: support headwise attention 260126
#631 opened Jan 26, 2026 by qqbbiu Loading…
Feature/sm100 attention
#630 opened Jan 26, 2026 by zerozw Loading…
feat: add HybridKVCacheAllocator
#629 opened Jan 25, 2026 by SJTUGavinLiu Loading…
Feature/p2p connector
#627 opened Jan 23, 2026 by zhangchicc Loading…
feat: return raw output in streaming mode
#626 opened Jan 23, 2026 by soaringk Loading…
fix: fix truncate token type err
#625 opened Jan 23, 2026 by wanglining97 Loading…
fix: fix health check error
#622 opened Jan 22, 2026 by ABNER-1 Loading…
fix: fix master role type err
#619 opened Jan 22, 2026 by wanglining97 Loading…
chore: skip check for torch profiler get info
#614 opened Jan 21, 2026 by Vinkle-hzt Loading…
add model support for deepseek_vl2
#612 opened Jan 21, 2026 by wuchen-shendaxia Loading…
feat: support headwise attention
#609 opened Jan 21, 2026 by Echo-2334 Loading…
optimize AMD layernorm
#606 opened Jan 20, 2026 by yanglf1121 Loading…
optimize kv_block_array
#604 opened Jan 20, 2026 by Xu-Sheng-lin Loading…
ProTip! Adding no:label will show everything without a label.