为什么使用paddleocr.PaddleOCR和paddleocr.TextDetection预测同一张图片会得到不同的文本框 #15705

lin-contextere · 2025-06-12T18:32:04Z

lin-contextere
Jun 12, 2025

我是用paddleocr.PaddleOCR和paddleocr.TextDetection预测同一张图片，两种方法我都用的默认设置，在检测部分，两种方法应该都是用的'PP-OCRv5_server_det'模型，但是两种方法会得到不同的文本框。我想问一下是两种方法的默认模型参数设置不一样吗？如果是的话，我应该如何查看和比较默认设置？

版本： PaddleOCRv3

复现代码：
from paddleocr import PaddleOCR, TextDetection

ocr = PaddleOCR(
use_doc_orientation_classify=False,
use_doc_unwarping=False,
use_textline_orientation=False)

det = TextDetection()

img_path = "0000189.jpg"

ocr_result = ocr.predict(input=img_path)
for res in ocr_result:
res.save_to_img("output")
res.save_to_json(save_path="./output/ocr_res.json")

det_result = det.predict(input=img_path)
for res in det_result:
res.save_to_img("output")
res.save_to_json(save_path="./output/det_res.json")

原图：

OCR结果：

ocr_res.json

Detection结果：

det_res.json

OCR结果包含6个文本框，Detection结果包含3个文本框。

liuhongen1234567 · 2025-06-13T14:25:59Z

liuhongen1234567
Jun 13, 2025
Collaborator

您好，OCR产线中的文本检测并不是默认配置，详细可参考 https://github.com/PaddlePaddle/PaddleX/blob/b592f3760d815a245358989c15c08b82bcf50162/paddlex/configs/pipelines/OCR.yaml#L25 。文本检测模块的默认配置可以通过下载推理模型，查看inference.yaml 获取

2 replies

lin-contextere Jun 13, 2025
Author

收到，十分感谢！

liuhongen1234567 Jun 13, 2025
Collaborator

我这边又看了一下，检测模块用的是960 max，OCR产线用的是64 min,两者的缩放策略不一样，这么改检测模块的配置参数，两者应该是对齐的

from paddleocr import TextDetection
model = TextDetection(model_name="PP-OCRv5_server_det",limit_side_len=64, limit_type="min")
output = model.predict("/paddle/env/454535939-d8fe26ab-156c-4ab6-bf63-0d9611d2616b.jpg", batch_size=1)
for res in output:
    res.print()
    res.save_to_img(save_path="./output1/")
    res.save_to_json(save_path="./output1/res.json")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

为什么使用paddleocr.PaddleOCR和paddleocr.TextDetection预测同一张图片会得到不同的文本框 #15705

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

为什么使用paddleocr.PaddleOCR和paddleocr.TextDetection预测同一张图片会得到不同的文本框 #15705

Uh oh!

lin-contextere Jun 12, 2025

Replies: 1 comment · 2 replies

Uh oh!

Uh oh!

liuhongen1234567 Jun 13, 2025 Collaborator

Uh oh!

lin-contextere Jun 13, 2025 Author

Uh oh!

liuhongen1234567 Jun 13, 2025 Collaborator

lin-contextere
Jun 12, 2025

Replies: 1 comment 2 replies

liuhongen1234567
Jun 13, 2025
Collaborator

lin-contextere Jun 13, 2025
Author

liuhongen1234567 Jun 13, 2025
Collaborator