Replies: 1 comment 2 replies
-
|
Hi @chengxy1! I'm Dosu and I’m helping the MinerU team. 要让MinerU自动提取PDF公式编号并在Markdown公式后追加编号(如用
核心改动点就是:公式编号的检测与配对(MagicModel),数据结构的传递,Markdown渲染时追加\tag{}(merge_para_with_text)。这样就能实现你要的自动追加公式编号功能。 To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
当前MinerU只能实现对PDF中的公式进行提取,但公式对应在文中的编号无法提取到Markdown文件中,这对使用者回到论文原文中验证公式检测是否正确增添了麻烦,若实现对公式编号的提取就会方便很多。

另外,我想知道怎么样去修改代码实现对生成的Markdown文件的修改。之前有个讨论说可以在Markdown中公式的LaTex代码后面加入\tag{}来实现追加编号,我希望修改代码可以实现在生成md文件后自动追加公式编号,请问我可以修改哪些代码实现这个操作?
At present, MinerU can only extract the formula in the PDF, but the number of the formula corresponding to the text cannot be extracted into the Markdown file, which adds trouble for the user to go back to the original text of the paper to verify whether the formula detection is correct, and it will be much more convenient to extract the formula number if it is realized.
Also, I'd like to know how to modify the code to make changes to the generated Markdown file. There was a discussion before that you can add "\tag{}" to the end of the LaTex code of the formula in Markdown to achieve append numbering, I hope that the modified code can automatically append the formula number after generating the md file, what code can I modify to achieve this operation?
Beta Was this translation helpful? Give feedback.
All reactions