PaddleOCR OCR analyzes Left-to-Right instead of Right-to-Left for Arabic- How to process RTL languages correctly? #14947

roostaeecs · 2025-03-27T22:12:18Z

roostaeecs
Mar 27, 2025

I am using PaddleOCR with the Arabic language model (lang='ar') to perform OCR on Arabic images. While PaddleOCR correctly recognizes the Arabic characters, it processes the text in a Left-to-Right (LTR) order, which is incorrect for Arabic, a Right-to-Left (RTL) language. This results in the words and sentences being in reverse order.

I have reviewed the paddleocr --help output to see if there are any options to explicitly set the text direction or handle RTL languages like Arabic.

My question is:

Is there a specific option in PaddleOCR, possibly using ocr_order_method or another parameter, to correctly handle Right-to-Left languages like Arabic and ensure the output text is in the correct RTL order?

If there isn't a built-in option, what are the recommended workarounds to post-process the OCR output to reorder the text correctly for RTL languages in Python?

Any guidance or solutions on how to get PaddleOCR to output Arabic text in the correct Right-to-Left order would be greatly appreciated.

I tried to use the following code:

from paddleocr import PaddleOCR, draw_ocr

ocr = PaddleOCR(use_angle_cls=True, lang='Ar')
img_path = 'image5.jpg'
result = ocr.ocr(img_path, cls=True)
for idx in range(len(result)):
res = result[idx]
for line in res:
print(line)

draw result

from PIL import Image
result = result[0]
image = Image.open(img_path).convert('RGB')
boxes = [line[0] for line in result]
txts = [line[1][0] for line in result]
scores = [line[1][1] for line in result]
im_show = draw_ocr(image, boxes, txts, scores, font_path='./doc/fonts/arabic.ttf')
im_show = Image.fromarray(im_show)
im_show.save('result.jpg')

GreatV · 2025-03-27T22:17:32Z

GreatV
Mar 27, 2025
Maintainer

Currently, PaddleOCR does not offer a built-in option such as ocr_order_method to explicitly specify or automatically detect Right-to-Left (RTL) text direction for languages like Arabic. This means that even if the Arabic characters are recognized correctly (as you’ve observed), the final output is often ordered in a Left-to-Right (LTR) fashion, which is unsuitable for correct Arabic text rendering.

Recommended Workaround: Post-Processing the OCR Output

To produce properly ordered Arabic text, you should handle the reordering in a post-processing step using Python. Here’s how to do so:

Use arabic_reshaper and python-bidi to reorder text:
The paddled text returned from PaddleOCR is usually segmented by bounding boxes. Each line of text returned is in LTR order by default, even for RTL languages. To render full Arabic sentences in proper RTL orientation, you can use:
- arabic_reshaper: handles character shaping (contextual forms),
- python-bidi: reorders the shaped text to conform to RTL display rules.
Installation:
```
pip install arabic-reshaper python-bidi
```

Example to convert PaddleOCR outputs to correct RTL Arabic:
Add the following to your script after getting txts list:

import arabic_reshaper
from bidi.algorithm import get_display

reshaped_txts = [
    get_display(arabic_reshaper.reshape(text))
    for text in txts
]

Then pass reshaped_txts instead of txts to draw_ocr:

im_show = draw_ocr(image, boxes, reshaped_txts, scores, font_path='./doc/fonts/arabic.ttf')

This ensures both the displayed and saved results appear in correct Arabic RTL order.

For console or log output:
When printing to console, also apply reshaping and bidi rendering:

for idx in range(len(result)):
    res = result[idx]
    for line in res:
        original_text = line[1][0]
        reshaped_text = arabic_reshaper.reshape(original_text)
        bidi_text = get_display(reshaped_text)
        print(bidi_text)

Note: The detection box order returned by PaddleOCR is influenced by the location of boxes on the image. You may need to sort boxes manually (e.g. top-to-bottom, then right-to-left within lines) for more accurate sentence ordering in complex layouts.

Conclusion:

There is no native support in PaddleOCR yet for RTL output order.
Apply post-processing using arabic_reshaper and python-bidi to correct the text order and shaping.
Ensure proper font is used during rendering to support Arabic glyphs.

These steps should help you present OCR results in the correct Right-to-Left order for Arabic and other RTL languages.

Response generated by 🤖 feifei-bot | chatgpt-4o-latest

⚠️ This is an AI-generated response. Please verify any critical information.

0 replies

timminator · 2025-08-10T00:39:04Z

timminator
Aug 10, 2025

Stumbled across this today aswell. This is kinda tricky. Arabic reshaper and the bidi algorithm did not work for me because the results from paddleocr are in a visual order and instead need to be converted to a correct logical order. Bidi already expects a logical order so this doesn't help us.
I found a solution now, that works for everything I've tried so far. We manually convert the order of the ocr result from PaddleOCR for right to left languages. Here is an example script:

import json
import re
from paddleocr import PaddleOCR


# Converts text from the OCR's non-standard 'reversed visual' order to the correct 'logical' order.
def convert_visual_to_logical(text: str) -> str:

    ARABIC_CHARS = re.compile(r'[\u0600-\u06FF\u0750-\u077F\u08A0-\u08FF\uFB50-\uFDFF\uFE70-\uFEFF]+')
    ARABIC_TRAILING_PUNCT = re.compile(r'([،؟؛!,.:?()\'"]+)$')

    words = text.split()
    fixed_words = []
    arabic_words = []

    for w in words:
        if ARABIC_CHARS.search(w):
            m = ARABIC_TRAILING_PUNCT.search(w)
            if m:
                punct = m.group(1)
                core_word = w[:-len(punct)]
            else:
                punct = ''
                core_word = w

            reversed_core = core_word[::-1]

            arabic_words.append(reversed_core + punct)
        else:
            if arabic_words:
                fixed_words.extend(arabic_words[::-1])
                arabic_words = []
            fixed_words.append(w)

    if arabic_words:
        fixed_words.extend(arabic_words[::-1])

    return ' '.join(fixed_words)


ocr = PaddleOCR(
    lang='ar',
    use_doc_orientation_classify=False,
    use_doc_unwarping=False,
    use_textline_orientation=False
)

img_path = r"path\to\your\image"

results = ocr.predict(img_path)

all_data = []

if results and results[0]:
    ocr_data = results[0]
    
    texts = ocr_data.get('rec_texts', [])
    scores = ocr_data.get('rec_scores', [])
    polygons = ocr_data.get('rec_polys', [])
    
    for text, score, poly in zip(texts, scores, polygons):
        all_data.append({
            # Convert text
            "text": convert_visual_to_logical(text),
            "confidence": score,
            "points": poly.tolist() 
        })

with open("output.json", "w", encoding="utf-8") as f:
    json.dump(all_data, f, ensure_ascii=False, indent=2)

print("Saved converted OCR results to output.json")

I hope it helps someone else.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PaddleOCR OCR analyzes Left-to-Right instead of Right-to-Left for Arabic- How to process RTL languages correctly? #14947

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

PaddleOCR OCR analyzes Left-to-Right instead of Right-to-Left for Arabic- How to process RTL languages correctly? #14947

Uh oh!

roostaeecs Mar 27, 2025

draw result

Replies: 2 comments

Uh oh!

GreatV Mar 27, 2025 Maintainer

Uh oh!

Uh oh!

timminator Aug 10, 2025

roostaeecs
Mar 27, 2025

GreatV
Mar 27, 2025
Maintainer

timminator
Aug 10, 2025