PP-OCRv5误检测率升高 #15576

eritpchy · 2025-05-23T09:53:17Z

eritpchy
May 23, 2025

🔎 Search before asking

I have searched the PaddleOCR Docs and found no similar bug report.
I have searched the PaddleOCR Issues and found no similar bug report.
I have searched the PaddleOCR Discussions and found no similar bug report.

🐛 Bug (问题描述)

@Sunting78 测试案例来了
用例我都写好了, 按下面仓库里面执行就都有了

问题:

V5的模型通常表现大面积的误识别, 这和V4不一样 V4通常是小面积的
误识别的区域通常表现为人脸, 手部等区域
仓库里面只截图了V5误识别比较明显的帧, 两边都明显误识别的我就跳过了
V5 mobile能识别省略号 server不能识别, 详见:
problem/output_server_det/test3/processed_test3.mp4_000003.867.jpg
V5 server漏识别的, v4相同部位能稳定识别, 详见:
problem/output_server_det/test1/processed_test1.mp4_000012.812.jpg
problem/output_server_det/test1/processed_test1.mp4_000058.885.jpg
problem/output_server_det/test1/processed_test1.mp4_000106.066.jpg

奇怪的问题:

在上面截图中 V5-mobile继承了V4-server 一样的错误, 反过来另外两个却没有这个现象, 所以V5-mobile是基于V4-server学习出来的? (我只是猜测, 并不是很关心)

当然, 上述案例有可能是因为V5(大幅?)增强了手写文字识别导致的, 因此我想问问未来有没有可能出个专门针对印刷体的det (ch-doc-det?)

🏃‍♂️ Environment (运行环境)

Docker python:3.11 image
paddlepaddle-gpu==3.0.0
paddleocr==3.0.0

cpu, onnxruntime-directml 都一样都测过了
Windows python3.12 + paddleocr==2.10.0 + onnx 表现也一样

以上用例均用命令行进行字幕检测
paddleocr text_detection -i {frames_dir} --save_path={v5_results_dir} --device gpu:0 --model_name {v5_model_name}
paddleocr text_detection -i {frames_dir} --save_path={v4_results_dir} --device gpu:0 --model_name {v4_model_name}

🌰 Minimal Reproducible Example (最小可复现问题的Demo)

https://github.com/eritpchy/PaddleOCR-3.0.0-TestCase/

Sunting78 · 2025-05-23T10:36:59Z

Sunting78
May 23, 2025
Collaborator

您好，感谢您的反馈，我们先自测一下，尝试是否可以通过后处理调节

0 replies

leo-q8 · 2025-05-26T03:56:31Z

leo-q8
May 26, 2025
Collaborator

@eritpchy v5 的迭代初衷更多希望一个模型解决手写、印刷体、中英文等多个场景，数据的丰富度要比 v4 高很多，所以在不同的场景可能表现差异也比较大。我们在文档场景里测试确实效果会比 V4 好很多。但在您的这个场景确实也带来了更高的误检，可以结合合模型的特点和使用场景调一下后处理的参数，比如我观察到在您的这些场景中误检的框的dt_score都不会很大，而真正字幕的框 dt_score基本都在 0.9 左右。后续我们也会持续优化，争取带来更好的体验，非常感谢您的反馈！

0 replies

leo-q8 · 2025-05-26T06:49:14Z

leo-q8
May 26, 2025
Collaborator

@eritpchy 我在测试指令中加入了 --box_thresh 0.8 来过滤误检，测试了一下test2.mp4，server 模型的效果还可以，您再试试看呢？

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PP-OCRv5误检测率升高 #15576

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

PP-OCRv5误检测率升高 #15576

Uh oh!

eritpchy May 23, 2025

🔎 Search before asking

🐛 Bug (问题描述)

🏃‍♂️ Environment (运行环境)

🌰 Minimal Reproducible Example (最小可复现问题的Demo)

Replies: 3 comments

Uh oh!

Sunting78 May 23, 2025 Collaborator

Uh oh!

Uh oh!

leo-q8 May 26, 2025 Collaborator

Uh oh!

leo-q8 May 26, 2025 Collaborator

eritpchy
May 23, 2025

Sunting78
May 23, 2025
Collaborator

leo-q8
May 26, 2025
Collaborator

leo-q8
May 26, 2025
Collaborator