-
-
Notifications
You must be signed in to change notification settings - Fork 10.5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Attention][UX] Add AttentionConfig and change attention backend to CLI argument
v1
#26315
opened Oct 6, 2025 by
MatthewBonanni
Loading…
2 of 5 tasks
[Docs] Fix broken table in moe_kernel_features doc
documentation
Improvements or additions to documentation
#26314
opened Oct 6, 2025 by
varun-sundar-rabindranath
Loading…
[Perf] Add decode full-graph support to FlashInfer-MLA backend
v1
#26313
opened Oct 6, 2025 by
benchislett
Loading…
docs: clarify remaining v0 references
codex
documentation
Improvements or additions to documentation
#26311
opened Oct 6, 2025 by
simon-mo
Loading…
[Benchmark] Enable MM Embedding benchmarks
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
#26310
opened Oct 6, 2025 by
DarkLight1337
Loading…
5 tasks
Add plugin registry and callbacks for AI model validation
frontend
v1
#26309
opened Oct 6, 2025 by
stefanberger
Loading…
2 of 5 tasks
[Model] Use Improvements or additions to documentation
merge_by_field_config
for Ovis families
documentation
[Model] Add ApertusToolParser
frontend
tool-calling
#26307
opened Oct 6, 2025 by
blancsw
Loading…
3 of 5 tasks
Bump flashinfer to 0.4.0rc2 to support determinism
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#26306
opened Oct 6, 2025 by
bwasti
Loading…
3 of 5 tasks
[Misc] Redact ray runtime env before logging
ready
ONLY add when PR is ready to merge/full CI is needed
#26302
opened Oct 6, 2025 by
ruisearch42
Loading…
5 tasks
[bugfix][DCP] fix block_size of hash in DCP prefix caching
v1
#26296
opened Oct 6, 2025 by
heheda12345
Loading…
5 tasks
[Bugfix] Fix ovis2.5 pre-quant fp8 checkpoint loading
#26294
opened Oct 6, 2025 by
Isotr0py
Loading…
5 tasks
[MODEL] Fix handling of multiple channels for gpt-oss with speculative decoding
frontend
gpt-oss
Related to GPT-OSS models
#26291
opened Oct 6, 2025 by
astralord
Loading…
[Kernel]
try_import_moe_kernels
->import_moe_kernels
#26286
opened Oct 6, 2025 by
NickLucche
Loading…
[Metrics] Log multi-modal cache stats
multi-modality
Related to multi-modality (#4194)
v1
#26285
opened Oct 6, 2025 by
DarkLight1337
Loading…
5 tasks
[TPU] Rename tpu_commons to tpu_inference
tpu
Related to Google TPUs
v1
#26279
opened Oct 6, 2025 by
utkarshsharma1
Loading…
5 tasks
[Kernel][Model] Tune fused_moe Triton configs for Qwen3-30B A3/A3B on H100 (FP8/BF16)
qwen
Related to Qwen models
#26268
opened Oct 6, 2025 by
shivampr
Loading…
[BugFix] Update KV block hash type from BlockHash to ExternalBlockHash in kv_events_subscriber - #26264
documentation
Improvements or additions to documentation
#26265
opened Oct 5, 2025 by
atalhens
Loading…
3 of 5 tasks
[Bugfix] Padded Eagle Specdec with Chunked Prefill
speculative-decoding
v1
#26263
opened Oct 5, 2025 by
Flechman
Loading…
5 tasks
[Model] Define merge_by_field_config MM interface (U-Z)
#26261
opened Oct 5, 2025 by
ayushsatyam146
Loading…
[Model] Define merge_by_field_config MM interface (R-T)
#26260
opened Oct 5, 2025 by
ayushsatyam146
Loading…
[Feature][torch.compile] Add pass to rearrange AllGather for FP8 models in sequence parallel for better Async TP fusion
ci/build
#26257
opened Oct 5, 2025 by
jasonlizhengjian
Loading…
3 of 5 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-09-06.