Skip to content
Discussion options

You must be logged in to vote

Hi @GnosisLabsG! I'm Dosu and I’m helping the MinerU team.

MinerU only supports PaddleOCR as its OCR engine—there’s no built-in way to switch to a different OCR engine like LightOnOCR-2-1B or to use multiple OCR engines at once. All CLI/API options (like --lang or --source) only affect which PaddleOCR model is used, not the underlying engine itself.

If you want to use a different OCR engine, you’d need to modify the codebase, specifically the OCR model initialization logic (e.g., ocr_model_init and related classes), to instantiate your own OCR implementation that matches the expected interface and output format. There’s no abstraction layer or plugin system for swapping OCR engines, so t…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by GnosisLabsG
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant