Replies: 1 comment
-
|
Hi @zixing15! I'm Dosu and I’m helping the MinerU team. 你遇到的问题根本原因是Docker部署默认使用pipeline后端,而你在Windows上通过uv部署时可能使用了VLM后端,这导致了表格识别结果的差异。 最直接的解决方案:切换到VLM后端 在Docker环境中显式指定使用VLM后端: # 使用VLM后端
docker run --rm -v /path/to/input:/input -v /path/to/output:/output \
opendatalab/mineru:latest \
magic-pdf -p /input/your_bill.pdf -o /output --backend vlm-vllm-engine
# 或使用hybrid后端
magic-pdf -p /input/your_bill.pdf -o /output --backend hybrid-auto-engine如果是通过Python API调用,需要在导入MinerU之前设置环境变量: import os
os.environ['BACKEND'] = 'vlm-vllm-engine' # 必须在导入前设置
from magic_pdf import parse_pdf
# 然后调用API为什么你修改的参数没有生效? 你修改的 另外,VLM后端对复杂表格(账单、发票等)的识别准确度显著高于pipeline后端,调整pipeline的参数无法从根本上解决架构性限制。 其他建议:
To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
".join(ocr_rec_text),但是还是失败了,解析不成功
但是我一开始直接在window上通过uv部署的时候图里的账单时能成功解析的,单号和日期分离很成功,就是windows上运行太慢了
有没有大佬帮忙看下该如何解决,谢谢
Beta Was this translation helpful? Give feedback.
All reactions