yolo dataset format to paddleocr dataset format #13177
-
I want to finetune paddleocr. Now I have image and label dataset in a yolov8 format, so how do I convert to padlleocr format and its folder structure and finetune it. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
please refer to: PaddleOCR/tools/infer/predict_system.py Lines 124 to 144 in 4336771 |
Beta Was this translation helpful? Give feedback.
-
The annotation file format supported by the text detection algorithm in PaddleOCR is as follows, separated by '\t':
We do not provide tools for data conversion, but you can easily convert the yolo format to the paddleocr detection format. If you do not have a field for text content, you can set the transcription field to an empty character, such as "". |
Beta Was this translation helpful? Give feedback.
The annotation file format supported by the text detection algorithm in PaddleOCR is as follows, separated by '\t':
We do not provide tools for data conversion, but you can easily convert the yolo format to the paddleocr detection format.
If you do not have a field for text content, you can set the transcription field to an empty character, such as "".
Text detection reference document: https://github.com/PaddlePaddle/PaddleOCR/blob/main/doc/doc_en/detection_en.md
Text recognition r…