Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Attention
#31851 opened Jan 7, 2026 by LucasWilkinson Loading…
5 tasks
[CI] Fix weight mapping test for transformers v5 tied weights multi-modality Related to multi-modality (#4194)
#31849 opened Jan 7, 2026 by AndreasKaratzas Loading…
[Model] Add Grok-2 documentation Improvements or additions to documentation
#31847 opened Jan 7, 2026 by dangoldbj Loading…
5 tasks
Optimize graph capture size
#31846 opened Jan 7, 2026 by jiahanc Draft
5 tasks
[responsesAPI] get reasoning token metrics for simpleContext frontend gpt-oss Related to GPT-OSS models
#31839 opened Jan 6, 2026 by qandrew Draft
5 tasks
[1/2][lmcache connector] clean up lmcache multi-process adapter kv-connector ready ONLY add when PR is ready to merge/full CI is needed
#31838 opened Jan 6, 2026 by ApostaC Loading…
5 tasks
[Perf] Fuse stride preparation for NVFP4 cutlass_moe nvidia performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#31837 opened Jan 6, 2026 by mgoin Loading…
5 tasks
[CI/Build] Enable test_kv_cache_events_dp for AMD rocm Related to AMD ROCm v1
#31834 opened Jan 6, 2026 by rjrock Loading…
3 tasks done
[ROCm][CI] v1 cpu offloading attention backend fix rocm Related to AMD ROCm v1
#31833 opened Jan 6, 2026 by AndreasKaratzas Loading…
[Perf][Kernel] Fused SiLU+Mul+Quant kernel for NVFP4 cutlass_moe nvidia performance Performance-related issues
#31832 opened Jan 6, 2026 by mgoin Loading…
5 tasks
[Perf] Optimize cutlass moe problem size calculation, 5.3% E2E Throughput improvement, 2.2% TTFT improvement nvidia ready ONLY add when PR is ready to merge/full CI is needed
#31830 opened Jan 6, 2026 by yewentao256 Loading…
[Perf] Add opt-in SM100 Oink RMSNorm custom-op path
#31828 opened Jan 6, 2026 by Laurawly Loading…
[MoE Refactor][17/N] Apply Refactor to Bf16
#31827 opened Jan 6, 2026 by zyongye Loading…
5 tasks
[Perf] Slight improvement of ITL with multiple GPUs
#31826 opened Jan 6, 2026 by access2rohit Loading…
2 of 5 tasks
[Model] Enable LoRA support for tower and connector in DotsOCR documentation Improvements or additions to documentation
#31825 opened Jan 6, 2026 by ShaanveerS Loading…
[CI] Add CUDA 13 nightly containers ci/build nvidia
#31822 opened Jan 6, 2026 by csahithi Loading…
5 tasks
[ROCm][AITER] bugfix accuracy regression in ROCM_AITER_TRITON_MLA backend rocm Related to AMD ROCm v1
#31816 opened Jan 6, 2026 by vllmellm Loading…
5 tasks
Enable LoRA support for tower and connector in Mistral and Voxtral deepseek Related to DeepSeek models documentation Improvements or additions to documentation frontend qwen Related to Qwen models
#31812 opened Jan 6, 2026 by Anexdeus Loading…
ProTip! Exclude everything labeled bug with -label:bug.