Skip to content

Pull requests: vllm-project/vllm-gaudi

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add out-of-tree HPU schedulers
#119 opened Sep 1, 2025 by kzawora-intel Loading…
[WARMUP] fix update bucket
#118 opened Aug 29, 2025 by xuechendi Loading…
[Bucketing] WA for warmup big values - crash
#116 opened Aug 29, 2025 by adobrzyn Loading…
Re-quantize FP8 model with INC
#114 opened Aug 29, 2025 by yiliu30 Draft
Add tests for custom op registration
#109 opened Aug 28, 2025 by Kacper-Pietkun Loading…
[Merged Prefill] Warmup for merged prefill
#104 opened Aug 26, 2025 by adobrzyn Loading…
[Bucketing] Read buckets from file
#101 opened Aug 23, 2025 by adobrzyn Draft
initial port
#100 opened Aug 22, 2025 by hsubramony Draft
Add data parallel support
#80 opened Aug 14, 2025 by wuxun-zhang Loading…
3 tasks done
Add attention unit tests
#74 opened Aug 12, 2025 by tthaddey Loading…
Lookahead decoding
#72 opened Aug 11, 2025 by jkaniecki Loading…
Enable embedding feature
#71 opened Aug 11, 2025 by slokesha Draft
Fixed Plugin Test
#70 opened Aug 8, 2025 by slokesha Loading…
CI tests: do not merge
#59 opened Aug 6, 2025 by adobrzyn Draft
[test] Add yaml files for fp8 tests
#53 opened Jul 29, 2025 by ulivne Loading…
Add support for LoRA
#51 opened Jul 29, 2025 by vivekgoe Loading…
Add sampler unit tests
#46 opened Jul 28, 2025 by kzawora-intel Loading…
Proper chunked prefill bucketing/warmup
#32 opened Jul 16, 2025 by kzawora-intel Loading…
ProTip! Filter pull requests by the default branch with base:main.