Text Detection Dataset Annotation & end-to-end evaluation #15400

Jeep-Z · 2025-05-26T08:58:52Z

Jeep-Z
May 26, 2025

I wanna finetune text detection model and use 4 points polygon annotation. Should it be converted to rotated rectangle? Or 4 points polygon annotation also supported?
Is there an end-to-end evaluation module?

May 31, 2025

Hello, the default annotation for the text detection model is a 4-point box, and there is no need for a rotated detection box. For detailed training, please refer to section 4.1 “Dataset and Pretrained Model Preparation.” in https://paddlepaddle.github.io/PaddleOCR/latest/en/version3.x/module_usage/text_detection.html#4-custom-development
The specific label format is as follows:

images/train_img_61.jpg	[{"transcription": "###", "points": [[427, 293], [469, 293], [468, 315], [425, 314]]}, {"transcription": "###", "points": [[480, 291], [651, 289], [650, 311], [479, 313]]}, {"transcription": "Ave", "points": [[655, 287], [698, 287], [696, 309], [652, 309]]}, {"transcription": "West", "points"…

View full answer

liuhongen1234567 · 2025-05-31T12:26:06Z

liuhongen1234567
May 31, 2025
Collaborator

Hello, the default annotation for the text detection model is a 4-point box, and there is no need for a rotated detection box. For detailed training, please refer to section 4.1 “Dataset and Pretrained Model Preparation.” in https://paddlepaddle.github.io/PaddleOCR/latest/en/version3.x/module_usage/text_detection.html#4-custom-development
The specific label format is as follows:

images/train_img_61.jpg	[{"transcription": "###", "points": [[427, 293], [469, 293], [468, 315], [425, 314]]}, {"transcription": "###", "points": [[480, 291], [651, 289], [650, 311], [479, 313]]}, {"transcription": "Ave", "points": [[655, 287], [698, 287], [696, 309], [652, 309]]}, {"transcription": "West", "points": [[701, 285], [759, 285], [759, 308], [701, 308]]}, {"transcription": "YOU", "points": [[1044, 531], [1074, 536], [1076, 585], [1046, 579]]}, {"transcription": "CAN", "points": [[1077, 535], [1114, 539], [1117, 595], [1079, 585]]}, {"transcription": "PAY", "points": [[1119, 539], [1160, 543], [1158, 601], [1120, 593]]}, {"transcription": "LESS?", "points": [[1164, 542], [1252, 545], [1253, 624], [1166, 602]]}, {"transcription": "Singapore's", "points": [[1032, 177], [1185, 73], [1191, 143], [1038, 223]]}, {"transcription": "no.1", "points": [[1190, 73], [1270, 19], [1278, 91], [1194, 133]]}]

3 replies

liuhongen1234567 May 31, 2025
Collaborator

For end-to-end evaluation, you can refer to E2EMetric in PGNet

PaddleOCR/configs/e2e/e2e_r50_vd_pg.yml

Line 63 in fd5b4e1

Metric:

Jeep-Z Jun 1, 2025
Author

For end-to-end evaluation, I want to evaluate detection and recognition in pipeline. Thx, I will evaluate det & rec one by one.

liuhongen1234567 Jun 1, 2025
Collaborator

Hello, PaddleOCR provides complete end-to-end inference code, such as python tools/infer/predict_system.py --image_dir="testB" --det_model_dir="./ch_PP-OCRv4_det_infer/" --rec_model_dir="./ch_PP-OCRv4_rec_infer/" --draw_img_save_dir="output" or the PP-OCRv5 pipeline https://paddlepaddle.github.io/PaddleOCR/main/en/version3.x/pipeline_usage/OCR.html#1-ocr-pipeline-introduction. Unfortunately, as far as I know, there is no corresponding end-to-end evaluation code provided. You can refer to E2EMetric and the end-to-end output results to implement the end-to-end evaluation yourself.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Text Detection Dataset Annotation & end-to-end evaluation #15400

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Text Detection Dataset Annotation & end-to-end evaluation #15400

Uh oh!

Uh oh!

Jeep-Z May 26, 2025

Replies: 1 comment · 3 replies

Uh oh!

liuhongen1234567 May 31, 2025 Collaborator

Uh oh!

liuhongen1234567 May 31, 2025 Collaborator

Uh oh!

Jeep-Z Jun 1, 2025 Author

Uh oh!

liuhongen1234567 Jun 1, 2025 Collaborator

Jeep-Z
May 26, 2025

Replies: 1 comment 3 replies

liuhongen1234567
May 31, 2025
Collaborator

liuhongen1234567 May 31, 2025
Collaborator

Jeep-Z Jun 1, 2025
Author

liuhongen1234567 Jun 1, 2025
Collaborator