Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Perf] Early return in KVCacheManager.allocate_slots v1
#29206 opened Nov 21, 2025 by Jialin Draft
3 of 5 tasks
[CI/Build] Add terratorch for AMD ci/build rocm Related to AMD ROCm
#29205 opened Nov 21, 2025 by rjrock Loading…
3 of 5 tasks
[CI/Build] Disable test_gptoss_tp.py in 'LoRA TP Test' group for ROCm platform ci/build gpt-oss Related to GPT-OSS models rocm Related to AMD ROCm
#29204 opened Nov 21, 2025 by qli88 Loading…
[CI] Bug: Fix triton import issue force-merge ready ONLY add when PR is ready to merge/full CI is needed v1
#29202 opened Nov 21, 2025 by yewentao256 Loading…
[amd][ci] update mteb version for ROCM ci/build rocm Related to AMD ROCm
#29199 opened Nov 21, 2025 by bradleyhd Draft
5 tasks
[Frontend] Implement robust video frame recovery for corrupted videos documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) performance Performance-related issues
#29197 opened Nov 21, 2025 by vSeamar Draft
5 tasks
[perf][cpu] Accelerate attention GEMMs (QK, PV) on Arm CPUs with NEON aarch64-cpu performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed v1
#29193 opened Nov 21, 2025 by fadara01 Loading…
2 tasks
[Models] Lfm2-VL Architecture documentation Improvements or additions to documentation new-model Requests to new models
#29191 opened Nov 21, 2025 by paulpak58 Loading…
5 tasks
[Doc] Update more docs with respect to V1 documentation Improvements or additions to documentation
#29188 opened Nov 21, 2025 by DarkLight1337 Loading…
5 tasks
[LoRA] Cleanup FusedMoEWithLoRA
#29187 opened Nov 21, 2025 by jeejeelee Loading…
5 tasks
[Misc] Further clean up chunked prefill and prefix caching init ready ONLY add when PR is ready to merge/full CI is needed v1
#29186 opened Nov 21, 2025 by DarkLight1337 Loading…
5 tasks
Add fused MoE config for H200 E160 N192 fp8
#29182 opened Nov 21, 2025 by FlintyLemming Loading…
3 of 5 tasks
docs: fixes distributed executor backend config for multi-node vllm documentation Improvements or additions to documentation
#29173 opened Nov 21, 2025 by michaelact Loading…
5 tasks
[docs] Fix cudagraph mode config documentation Improvements or additions to documentation nvidia ready ONLY add when PR is ready to merge/full CI is needed
#29170 opened Nov 21, 2025 by angelayi Loading…
[Core] Add xxHash as a high-performance hash option for accelerating prefix caching ci/build deepseek Related to DeepSeek models documentation Improvements or additions to documentation frontend gpt-oss Related to GPT-OSS models kv-connector llama Related to Llama models multi-modality Related to multi-modality (#4194) new-model Requests to new models nvidia performance Performance-related issues qwen Related to Qwen models rocm Related to AMD ROCm speculative-decoding structured-output v1
#29163 opened Nov 21, 2025 by LuminolT Loading…
4 of 5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.