Replies: 5 comments
-
您好,可以设置一下边长, 这个里面加两个参数,limit_type_side: 960 limit_type: max |
Beta Was this translation helpful? Give feedback.
-
加了eval结果: |
Beta Was this translation helpful? Give feedback.
-
我也遇到了同样的问题,我觉得是和ICDAR2015数据集的标注方式,以及IOU metrics的定义有关。例如下面是ICDAR2015,validation set 的img_10.jpg的ground truth 这个是PP-OCRv5_server_det的预测文本框 对于这个例子计算基于IOU的metrices会得到 Metrics 非常低,因为ground truth都是word级别的文本框,检测框都是行级别的,他们匹配不上。但从OCR pipeline中看,行级别的检测没有任何问题,并不影响后面识别模型的使用。只能说ICDAR2015的标注方式,和IOU metrics定义并不能体现PP-OCRv5_server_det的实际能力。PP-OCRv5_server_det在训练的时候用的标注数据可能多数都是行级别的,与ICDAR2015数据集的标注方式不匹配。 |
Beta Was this translation helpful? Give feedback.
-
您好 确实是这样 v5检测是文本行粒度。不是词汇粒度 |
Beta Was this translation helpful? Give feedback.
-
但是重新训练ICDAR数据也没用,标注不都是四个点标注? 应该跟标注没有关系吧,这本来就是做文本检测 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
🔎 Search before asking
🐛 Bug (问题描述)
W0527 22:25:26.478688 3286537 gpu_resources.cc:119] Please NOTE: device: 0, GPU Compute Capability: 8.6, Driver API Version: 12.8, Runtime API Version: 12.6
W0527 22:25:26.479807 3286537 gpu_resources.cc:164] device: 0, cuDNN Version: 9.5.
[2025/05/27 22:25:26] ppocr INFO: resume from /PP-OCRv5_server_det_pretrained
[2025/05/27 22:25:26] ppocr INFO: metric in ckpt ***************
[2025/05/27 22:25:26] ppocr INFO: is_float16:False
eval model:: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 500/500 [00:34<00:00, 14.44it/s]
[2025/05/27 22:26:01] ppocr INFO: metric eval ***************
[2025/05/27 22:26:01] ppocr INFO: precision:0.5834818775995246
[2025/05/27 22:26:01] ppocr INFO: recall:0.47279730380356283
[2025/05/27 22:26:01] ppocr INFO: hmean:0.5223404255319148
[2025/05/27 22:26:01] ppocr INFO: fps:21.754756046859978
🏃♂️ Environment (运行环境)
paddleocr3.0
🌰 Minimal Reproducible Example (最小可复现问题的Demo)
python3 tools/eval.py -cconfigs/det/PP-OCRv5/PP-OCRv5_server_det_train.yml -o Global.checkpoints="PP-OCRv5_server_det_pretrained"
Beta Was this translation helpful? Give feedback.
All reactions