Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Refactor] Simplify dummy data generation deepseek Related to DeepSeek models documentation Improvements or additions to documentation llama Related to Llama models multi-modality Related to multi-modality (#4194) qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed speculative-decoding
#35025 opened Feb 21, 2026 by DarkLight1337 Loading…
5 tasks
[Deprecation] Remove old locations of get_tokenizer and resolve_hf_chat_template frontend ready ONLY add when PR is ready to merge/full CI is needed
#35024 opened Feb 21, 2026 by DarkLight1337 Draft
5 tasks
[GGUF] Fix loading of fused/shard-less quantized weights
#35019 opened Feb 21, 2026 by laudney Loading…
2 of 3 tasks
[Feature] Add per-request attention capture to the OpenAI-compatible API documentation Improvements or additions to documentation frontend v1
#35014 opened Feb 21, 2026 by Parkprogrammer Loading…
[CI/Build] Fix gRPC version mismatch ci/build ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#35013 opened Feb 21, 2026 by DarkLight1337 Loading…
5 tasks
[XPU] Avoid use top_k_top_p_triton kernel for xpu ready ONLY add when PR is ready to merge/full CI is needed v1
#35011 opened Feb 21, 2026 by jikunshang Loading…
5 tasks
[XPU] allow TORCH_SDPA/TRITON_ATTN as XPU vit Backend
#35010 opened Feb 21, 2026 by yma11 Loading…
[CI] Bumping grpcio version ci/build needs-rebase rocm Related to AMD ROCm
#35008 opened Feb 21, 2026 by AndreasKaratzas Loading…
[Bugfix] Register VLLM_BATCH_INVARIANT in envs.py to fix spurious unknown env var warning bug Something isn't working
#35007 opened Feb 21, 2026 by WindChimeRan Loading…
3 of 5 tasks
[CI] Fix tests/evals/gsm8k/test_gsm8k_correctness.py for Qwen3-Next-80B-A3B-NVFP4-EP2 qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed v1
#34999 opened Feb 20, 2026 by LucasWilkinson Loading…
[ROCm] Check that AITER MHA is not selected with sinks rocm Related to AMD ROCm
#34998 opened Feb 20, 2026 by gshtras Loading…
[RL] Validation for pause_mode='keep' documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#34992 opened Feb 20, 2026 by hao-aaron Loading…
5 tasks
[Bugfix] ep_scatter kernel store-load race condition bug Something isn't working
#34991 opened Feb 20, 2026 by ivanium Loading…
5 tasks
[CI] Skip Responses API ci-failure Issue about an unexpected test failure in CI ready ONLY add when PR is ready to merge/full CI is needed
#34990 opened Feb 20, 2026 by robertgshaw2-redhat Loading…
[CI Bugfix] Add pytest.mark.flaky to tests/v1/e2e/test_spec_decode.py bug Something isn't working ci-failure Issue about an unexpected test failure in CI ready ONLY add when PR is ready to merge/full CI is needed v1
#34987 opened Feb 20, 2026 by mgoin Loading…
5 tasks
ProTip! Add no:assignee to see everything that’s not assigned.