Change OCR model ? #4429
-
|
MinerU using PaddleOCR, but am i able to change the OCR ? also why dont they add multi OCR support so we can use other OCRs like LightOnOCR-2-1B |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
|
Hi @GnosisLabsG! I'm Dosu and I’m helping the MinerU team. MinerU only supports PaddleOCR as its OCR engine—there’s no built-in way to switch to a different OCR engine like LightOnOCR-2-1B or to use multiple OCR engines at once. All CLI/API options (like If you want to use a different OCR engine, you’d need to modify the codebase, specifically the OCR model initialization logic (e.g., There is a limited extensibility point: advanced users can process the There are no current plans or roadmap for multi-OCR support or for supporting engines like LightOnOCR-2-1B. Adding this would require significant architectural changes, such as introducing an abstraction/interface for OCR engines and configuration options to select between them. To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
Hi @GnosisLabsG! I'm Dosu and I’m helping the MinerU team.
MinerU only supports PaddleOCR as its OCR engine—there’s no built-in way to switch to a different OCR engine like LightOnOCR-2-1B or to use multiple OCR engines at once. All CLI/API options (like
--langor--source) only affect which PaddleOCR model is used, not the underlying engine itself.If you want to use a different OCR engine, you’d need to modify the codebase, specifically the OCR model initialization logic (e.g.,
ocr_model_initand related classes), to instantiate your own OCR implementation that matches the expected interface and output format. There’s no abstraction layer or plugin system for swapping OCR engines, so t…