Missing first line of paragraph #14972
Replies: 1 comment
-
This appears to be a known issue where PaddleOCR may miss lines — often the first line of a paragraph — particularly in structured or dense documents such as reimbursement policy files. Based on your description and images, the red-marked areas clearly show missing lines that should have been detected and recognized. The potential causes for this behavior include:
🛠 Recommended Steps to Mitigate:
📌 Related Reference: Though that example is in Chinese and not specific to missing first lines, the underlying cause — missed detection — is relevant. Let me know if you’d like help adjusting detection thresholds or running a debug visualization to confirm whether the detector is missing the lines entirely. Response generated by 🤖 feifei-bot | chatgpt-4o-latest |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
🔎 Search before asking
🐛 Bug (问题描述)
I am using this OCR on insurance reimbursement policy document. I found OCR sometime misses the entire line. Whenever it misses the entire line it is usually the first line of the paragraph.
In the attached file, missed lines are in red box:
File with missing text highlighted

Original file (same as above but without highlight)

🏃♂️ Environment (运行环境)
🌰 Minimal Reproducible Example (最小可复现问题的Demo)
Beta Was this translation helpful? Give feedback.
All reactions