Skip to content

[Bug]:vllm从0.7.0开始版本部署Qwen2_vl服务存在内存(不是GPU显存)泄漏问题 #15597

@Syx93-c

Description

@Syx93-c

Your current environment

vllm从0.7.0开始版本部署Qwen2_vl服务存在内存(不是GPU显存)泄漏问题


🐛 Describe the bug

使用0.7.0版本的vllm部署Qwen2_vl模型服务时,对服务进行请求后,服务相关进程内存不会释放,最终打爆服务器内存,导致服务停止,测试发现0.6.6版本无此问题,0.7.0及以上版本均有此问题。

Image

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions