Replies: 1 comment
-
|
这是一个已知问题。MinerU 会将识别为页眉的内容归入 解决标题被误识别为页眉的问题: Pipeline 后端容易将位置靠上的标题误识别为页眉,建议切换到 hybrid 或 VLM 后端: magic-pdf -p /path/to/file.pdf -o /path/to/output -m hybrid
# 或
magic-pdf -p /path/to/file.pdf -o /path/to/output -m vlm获取页眉内容: 目前 MinerU 没有配置选项直接控制页眉是否输出到 Markdown。但可以通过以下变通方式获取:
也可以查看 To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
目前解析中错把标题识别为了页眉,同时生成markdown文件中不会显示页眉,如何通过设置将页眉内容加到生成的markdown文件中?
Beta Was this translation helpful? Give feedback.
All reactions