Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix qwen35 dp
#4535 opened Apr 18, 2026 by grimoire Collaborator Loading…
fix: prevent prefill starvation under high decode load
#4532 opened Apr 16, 2026 by grimoire Collaborator Loading…
Mixed modality
#4531 opened Apr 16, 2026 by CUHKSZzxy Collaborator Loading…
optimize get_sorted_idx in moe
#4529 opened Apr 15, 2026 by grimoire Collaborator Loading…
Test: update video sleep/wakeup and abort scenarios
#4528 opened Apr 15, 2026 by littlegy Contributor Loading…
style: add autopep8 pre-commit hook and apply PEP 8 formatting fixes
#4524 opened Apr 14, 2026 by windreamer Collaborator Loading…
[WIP]: Fix mtp experts
#4520 opened Apr 13, 2026 by RunningLeon Collaborator Loading…
fix qwen3.5 shared_expert_all_reduce
#4515 opened Apr 10, 2026 by yao-fengchen Collaborator Draft
make fp8 model quantized by llm-compressor can be inferenced in turbomind enhancement New feature or request
#4509 opened Apr 8, 2026 by 43758726 Collaborator Loading…
support more message item types
#4501 opened Apr 7, 2026 by CUHKSZzxy Collaborator Draft
fix: handle missing KV cache without crashing engine Bug:P0
#4497 opened Apr 4, 2026 by lvhan028 Collaborator Loading…
Integrate deep-ep nccl backend enhancement New feature or request
#4477 opened Mar 27, 2026 by irexyc Collaborator Loading…
feat: Turbomind linear gdn prefix caching enhancement New feature or request
#4465 opened Mar 25, 2026 by lapy Contributor Loading…
refactor get_ppl improvement
#4461 opened Mar 25, 2026 by lvhan028 Collaborator Loading…
feat: implement Turbomind vision encoder support for Qwen3VL/3.5 families enhancement New feature or request
#4460 opened Mar 24, 2026 by lapy Contributor Loading…
Support multi stop words improvement
#4454 opened Mar 24, 2026 by lvhan028 Collaborator Loading…
[WIP] Support qwen3-omni
#4411 opened Mar 13, 2026 by CUHKSZzxy Collaborator Draft
2 of 4 tasks
Fix Structured Output for GPT-OSS Models
#4386 opened Mar 2, 2026 by windreamer Collaborator Loading…
ProTip! Updated in the last three days: updated:>2026-04-15.