[PPOCR rec] Image size of Fineturning & Inference for custom dataset #14835
Unanswered
PhuongNDVN
asked this question in
Q&A
Replies: 1 comment
-
For your questions regarding image size during fine-tuning and inference with PPOCR recognition: 1. What should I set as the image size for PPOCR recognition during training?
2. Should I resize cropped text-region images while keeping the aspect ratio before passing them to OCR for inference?
3. Low Minimum Prediction Score for Individual Characters
To diagnose this issue further, consider visualizing low-confidence predictions and fine-tuning thresholds for confidence scores. For more details, refer to this related discussion on fine-tuning PPOCRv4 recognition: Response generated by 🤖 feifei-bot | chatgpt-4o-latest |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I am trying to fine-tune PPOCR recognition. If I fix the height of the images used for training at 48, the width of the images ranges from 376 to 460 while maintaining the original aspect ratio. What should I set as the image size for PPOCR recognition during training?
My second question is: should I resize the cropped text-region images while keeping the same aspect ratio before passing them to OCR for inference?
The model I fine-tuned predicts well on my custom data, but the minimum prediction score for individual characters in a word is low.
Beta Was this translation helpful? Give feedback.
All reactions