PaddleOCR/latest/en/index #14946
Replies: 6 comments 7 replies
-
I am using PaddleOCR with the Arabic language model (lang='fa') to perform OCR on Arabic images. While PaddleOCR correctly recognizes the Arabic characters, it processes the text in a Left-to-Right (LTR) order, which is incorrect for Arabic, a Right-to-Left (RTL) language. This results in the words and sentences being in reverse order. I have reviewed the paddleocr --help output to see if there are any options to explicitly set the text direction or handle RTL languages like Arabic. My question is: Is there a specific option in PaddleOCR, possibly using ocr_order_method or another parameter, to correctly handle Right-to-Left languages like Arabic and ensure the output text is in the correct RTL order? If there isn't a built-in option, what are the recommended workarounds to post-process the OCR output to reorder the text correctly for RTL languages in Python? Any guidance or solutions on how to get PaddleOCR to output Arabic text in the correct Right-to-Left order would be greatly appreciated. I tried to use the following code: from paddleocr import PaddleOCR, draw_ocr ocr = PaddleOCR(use_angle_cls=True, lang='Ar') draw resultfrom PIL import Image |
Beta Was this translation helpful? Give feedback.
-
.Warning: tiling offset out of range, index: 32 |
Beta Was this translation helpful? Give feedback.
-
great |
Beta Was this translation helpful? Give feedback.
-
/Users/home/Downloads/汤用彤 - 汉魏两晋南北朝佛教史 2.pdf |
Beta Was this translation helpful? Give feedback.
-
Hello, I have already trained the model using my own dataset and custom dictionary file, specifically It seems that PP-OCRv5 no longer supports this parameter, or at least I couldn’t find any documentation about it. How can I load and use my custom dictionary file during prediction in PP-OCRv5? Thank you very much! |
Beta Was this translation helpful? Give feedback.
-
hi I am running PPStructureV3 on my mac local All the weights of PPChart2TableInference were initialized from the model checkpoint at /Users/SamJia/.paddlex/official_models/PP-Chart2Table. Can anyone help? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
PaddleOCR/latest/en/index
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
https://paddlepaddle.github.io/PaddleOCR/latest/en/index.html
Beta Was this translation helpful? Give feedback.
All reactions