-
-
Notifications
You must be signed in to change notification settings - Fork 10.5k
Closed
Labels
bugSomething isn't workingSomething isn't working
Description
Your current environment
vllm从0.7.0开始版本部署Qwen2_vl服务存在内存(不是GPU显存)泄漏问题
🐛 Describe the bug
使用0.7.0版本的vllm部署Qwen2_vl模型服务时,对服务进行请求后,服务相关进程内存不会释放,最终打爆服务器内存,导致服务停止,测试发现0.6.6版本无此问题,0.7.0及以上版本均有此问题。

Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working