Skip to content
Discussion options

You must be logged in to vote

The annotation file format supported by the text detection algorithm in PaddleOCR is as follows, separated by '\t':

" image path                    Image annotation information encoded by json.dumps"
ch4_test_images/img_61.jpg    [{"transcription": "MASA", "points": [[310, 104], [416, 141], [418, 216], [312, 179]]}, {...}]

We do not provide tools for data conversion, but you can easily convert the yolo format to the paddleocr detection format.

If you do not have a field for text content, you can set the transcription field to an empty character, such as "".
Text detection reference document: https://github.com/PaddlePaddle/PaddleOCR/blob/main/doc/doc_en/detection_en.md
Text recognition r…

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@wdcs-krishpatel
Comment options

Comment options

You must be logged in to vote
1 reply
@wdcs-krishpatel
Comment options

Answer selected by wdcs-krishpatel
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants