Replies: 2 comments 2 replies
-
|
你遇到的显存暴涨(10GB)是因为 vllm 推理引擎启动时会预分配大量显存,这和 README 里提到的“最低显存需求”不同,实际需求远高于 3GB,官方建议至少 8GB 显存 参考。 解决方法:
如果你用的是 MinerU 2.6.1 及以上版本,显存管理已经优化,建议升级后再做上述配置。 如需更详细参数说明,可查阅 vllm 官方文档。 To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
-
|
client不需要太多显存,但是你这个是起的server,server本身就是需要大量显存的 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
启动方式:
docker compose -f compose.yaml --profile openai-server up -d --force-recreate
yaml:
mineru-openai-server:
image: mineru:latest
container_name: mineru-openai-server
restart: always
profiles: ["openai-server"]
ports:
- 30000:30000
environment:
MINERU_MODEL_SOURCE: local
entrypoint: mineru-openai-server
command:
--engine vllm
--host 0.0.0.0
--port 30000
ulimits:
memlock: -1
stack: 67108864
ipc: host
healthcheck:
test: ["CMD-SHELL", "curl -f http://localhost:30000/health || exit 1"]
volumes:
- /mnt/data/mineru/openai-server:/app/cache # HDD 映射
deploy:
resources:
reservations:
devices:
- driver: nvidia
device_ids: ["0"]
capabilities: [gpu]
Beta Was this translation helpful? Give feedback.
All reactions