Replies: 1 comment
-
|
2.7.6 版本显存降低但速度变慢是一个设计权衡。这个版本默认禁用了
恢复速度的方法:可以尝试重新启用这些优化(会增加显存使用): magic-vlm --engine vllm --enable-chunked-prefill --enable-prefix-caching --gpu-memory-utilization 0.8也可以调整 另外,如果你需要更好的性能,推荐使用双进程模式:部署 To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
2.7.6 Release我使用了vlm-auto-engine 显存只用了6G,比以前16G少了很多。 但是速度变慢了~ 是什么原因,可以调参数吗?
Beta Was this translation helpful? Give feedback.
All reactions