Text Detection Dataset Annotation & end-to-end evaluation #15400
-
|
Beta Was this translation helpful? Give feedback.
Answered by
liuhongen1234567
May 31, 2025
Replies: 1 comment 3 replies
-
Hello, the default annotation for the text detection model is a 4-point box, and there is no need for a rotated detection box. For detailed training, please refer to section 4.1 “Dataset and Pretrained Model Preparation.” in https://paddlepaddle.github.io/PaddleOCR/latest/en/version3.x/module_usage/text_detection.html#4-custom-development images/train_img_61.jpg [{"transcription": "###", "points": [[427, 293], [469, 293], [468, 315], [425, 314]]}, {"transcription": "###", "points": [[480, 291], [651, 289], [650, 311], [479, 313]]}, {"transcription": "Ave", "points": [[655, 287], [698, 287], [696, 309], [652, 309]]}, {"transcription": "West", "points": [[701, 285], [759, 285], [759, 308], [701, 308]]}, {"transcription": "YOU", "points": [[1044, 531], [1074, 536], [1076, 585], [1046, 579]]}, {"transcription": "CAN", "points": [[1077, 535], [1114, 539], [1117, 595], [1079, 585]]}, {"transcription": "PAY", "points": [[1119, 539], [1160, 543], [1158, 601], [1120, 593]]}, {"transcription": "LESS?", "points": [[1164, 542], [1252, 545], [1253, 624], [1166, 602]]}, {"transcription": "Singapore's", "points": [[1032, 177], [1185, 73], [1191, 143], [1038, 223]]}, {"transcription": "no.1", "points": [[1190, 73], [1270, 19], [1278, 91], [1194, 133]]}] |
Beta Was this translation helpful? Give feedback.
3 replies
Answer selected by
Jeep-Z
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hello, the default annotation for the text detection model is a 4-point box, and there is no need for a rotated detection box. For detailed training, please refer to section 4.1 “Dataset and Pretrained Model Preparation.” in https://paddlepaddle.github.io/PaddleOCR/latest/en/version3.x/module_usage/text_detection.html#4-custom-development
The specific label format is as follows: