Conversation
Signed-off-by: Chengji Yao <chengjiyao@google.com>
…d quantization (vllm-project#20766) Signed-off-by: Alex Kogan <alex.kogan@oracle.com>
…17818) Signed-off-by: Farzad Abdolhosseini <farzad@fixie.ai> Signed-off-by: Patrick Li <patrick8289@gmail.com> Co-authored-by: Patrick Li <patrick8289@gmail.com>
…vllm-project#21530) Signed-off-by: David Chen <530634352@qq.com>
…m-project#21641) Signed-off-by: Huy Do <huydhn@gmail.com>
… to use dtype comparison (vllm-project#21612) Signed-off-by: Alexandre Juan <a.juan@netheos.net>
Signed-off-by: Roger Wang <hey@rogerw.me> Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by: Your Name <you@example.com> Co-authored-by: Roger Wang <hey@rogerw.me> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Isotr0py <mozf@mail2.sysu.edu.cn>
…#21618) Signed-off-by: reidliu41 <reid201711@gmail.com>
…#21646) Signed-off-by: Huy Do <huydhn@gmail.com>
…lm-project#21620) Signed-off-by: Benji Beck <benjibeck@meta.com>
vllm-project#21622) Signed-off-by: Benji Beck <benjibeck@meta.com>
Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Signed-off-by: Max de Bayser <maxdebayser@gmail.com> Signed-off-by: Max de Bayser <mbayser@br.ibm.com> Co-authored-by: Russell Bryant <rbryant@redhat.com>
…llm-project#21634) Signed-off-by: yewentao256 <zhyanwentao@126.com>
Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com>
…er (vllm-project#21428) Signed-off-by: David Chen <530634352@qq.com>
Signed-off-by: yewentao256 <zhyanwentao@126.com>
… Mac with Apple Silicon (vllm-project#21380) Signed-off-by: Yeju Zhou <yejuzhou@outlook.com>
…e to vllm bench CLI (vllm-project#21355) Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com>
…e moe fp8 kernels (vllm-project#21411) Signed-off-by: kaixih <kaixih@nvidia.com>
…llm-project#21154) Signed-off-by: Wenchen Lo <charles761013@gmail.com>
…Schema (vllm-project#21656) Signed-off-by: Benji Beck <benjibeck@meta.com>
Signed-off-by: Benji Beck <benjibeck@meta.com>
Signed-off-by: Benji Beck <benjibeck@meta.com>
Signed-off-by: Benji Beck <benjibeck@meta.com>
Signed-off-by: Isotr0py <2037008807@qq.com>
…llm-project#21667) Signed-off-by: Ye (Charlotte) Qi <yeq@meta.com>
Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>
Signed-off-by: Nick Hill <nhill@redhat.com>
Signed-off-by: mgoin <mgoin64@gmail.com>
…2036) Signed-off-by: yewentao256 <zhyanwentao@126.com>
Signed-off-by: kf <kuanfu.liu@embeddedllm.com> Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by: kf <kuanfu.liu@embeddedllm.com>
Signed-off-by: NickLucche <nlucches@redhat.com>
…roject#21955) Signed-off-by: Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by: Varun Sundar Rabindranath <vsundarr@redhat.com>
…ject#21835) Signed-off-by: Dipika Sikka <dipikasikka1@gmail.com>
Signed-off-by: Nick Hill <nhill@redhat.com>
Signed-off-by: yewentao256 <zhyanwentao@126.com>
…ata (vllm-project#21153) Signed-off-by: Sage Moore <sage@neuralmagic.com>
…wen-VL models on ROCm platform. (vllm-project#22069) Signed-off-by: tjtanaavllm <tunjian.tan@amd.com> Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by: tjtanaavllm <tunjian.tan@amd.com>
…2040) Signed-off-by: Rui Qiao <ruisearch42@gmail.com>
Signed-off-by: Yong Hoon Shin <yhshin@meta.com>
…llm-project#22034) Signed-off-by: Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com>
Signed-off-by: Roger Wang <hey@rogerw.me>
…ist conversion (vllm-project#20000) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@centml.ai>
Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com> Co-authored-by: Isotr0py <2037008807@qq.com>
…ad (vllm-project#21075) Signed-off-by: Chih-Chieh Yang <7364402+cyang49@users.noreply.github.com> Signed-off-by: Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com>
…22114) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
…ering at init time (vllm-project#21557) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
…ation Introduce VLLM_COMPILE_DEPYF environment variable to allow users to toggle depyf decompilation during compilation. By default, a placeholder file is written unless VLLM_COMPILE_DEPYF=1 is set. This provides better control over when expensive decompilation operations are performed. Also ensures decompilation always occurs for cudagraph error checking, regardless of the env var setting, to prevent silent errors.
83272a1 to
fa7cef7
Compare
|
This pull request has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this pull request should remain open. Thank you! |
|
This pull request has been automatically closed due to inactivity. Please feel free to reopen if you intend to continue working on it. Thank you! |
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.Purpose
Test Plan
Test Result
(Optional) Documentation Update