-
I'm currently working with PaddleOCR to process screenshots of various sizes. I've encountered an issue where PaddleOCR does not correctly recognize all text elements in larger screenshots, specifically those with resolutions of 3858x2118 and 2576x1408. However, it works perfectly for screenshots with resolutions of 1936x1048 and 2578x1398. I attempted to split larger images into smaller fragments, process them individually with OCR, and then merge the results. While this approach works, I'm wondering if there is a better solution. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
It looks like you're facing challenges with PaddleOCR when processing large screenshots (3858x2118, 2576x1408) while it works fine on smaller ones. Based on the provided discussion, here are some suggestions to improve OCR performance on large images: Possible Issues:
Potential Solutions:
Additional Considerations:
Would you like specific code examples to implement image splitting or model tuning? Let me know how I can assist further! 🚀 Response generated by feifei-bot | chatgpt-4o-latest |
Beta Was this translation helpful? Give feedback.
-
To process large images or documents with PaddleOCR for detection and recognition, use the slice operation like this: ocr_inst = PaddleOCR(**ocr_settings)
results = ocr_inst.ocr(img, det=True, rec=True, slice=slice, cls=False, bin=False, inv=False, alpha_color=False) For more details, visit: https://paddlepaddle.github.io/PaddleOCR/latest/en/ppocr/blog/slice.html |
Beta Was this translation helpful? Give feedback.
To process large images or documents with PaddleOCR for detection and recognition, use the slice operation like this:
For more details, visit: https://paddlepaddle.github.io/PaddleOCR/latest/en/ppocr/blog/slice.html