为什么相同的文件在demo上和pipeline调用结果会有很大的差别 #15002

lixjohn · 2025-04-11T07:19:42Z

lixjohn
Apr 11, 2025

这是我在demo link上得到的结果
https://aistudio.baidu.com/community/app/91660/webUI?source=appMineRecent

但是我用缺省pipeline在python中调用结果差别很大，这是code
from paddlex import create_pipeline

pipeline = create_pipeline(pipeline="OCR")

output = pipeline.predict(
input="1.jpg",
use_doc_orientation_classify=False,
use_doc_unwarping=False,
use_textline_orientation=False,
)
for res in output:
res.print()
res.save_to_img(save_path="./output/")
res.save_to_json(save_path="./output/")

输出结果是

明显看到调用的结果和demo结果有很大差别，这里需要设置什么样的参数？

GreatV · 2025-04-11T07:22:16Z

GreatV
Apr 11, 2025
Maintainer

造成你在demo页面和Python代码中（使用缺省pipeline）所获得OCR识别结果差异明显，主要有以下几点可能的原因和需要设置的关键参数：

① 缺省参数设置不同是主要原因
在线demo设置了较高的图像长边阈值（如4000），而你的Python代码中调用 create_pipeline(pipeline="OCR") 使用的是默认配置，其图像预处理阶段的参数（如det_max_side_len）可能不同，因此模型输入图像的分辨率有差异。

建议在你的Python代码中，明确指定图像最大边长参数，例如：

from paddlex import create_pipeline
pipeline = create_pipeline(pipeline="OCR", det_max_side_len=4000)

或者在其他框架（如paddleocr原生方式）中使用：

ocr = PaddleOCR(det_max_side_len=4000, ocr_version="PP-OCRv4", use_angle_cls=True)

② 在线demo可能启用了部分增强功能，而你代码中未启用
根据你的代码：

output = pipeline.predict(
input="1.jpg",
use_doc_orientation_classify=False,
use_doc_unwarping=False,
use_textline_orientation=False,
)

你禁用了文档方向分类、图像矫正和文本线矫正等功能。而在在线demo中，这些可能默认是启用状态。如果你的预处理图像中存在旋转、仿射失真等问题，禁用这些功能会导致识别能力降低。

建议尝试将这些设置改为 True：

output = pipeline.predict(
input="1.jpg",
use_doc_orientation_classify=True,
use_doc_unwarping=True,
use_textline_orientation=True,
)

③ 模型版本一致性
确保你本地的OCR pipeline使用的是与demo一致的OCR版本，比如 demo使用的是 PP-OCRv4，而 paddlex 的 create_pipeline 默认可能是旧模型。

你可以显式指定使用 PP-OCRv4 模型，例如使用 paddleocr 的 PaddleOCR：

from paddleocr import PaddleOCR
ocr = PaddleOCR(use_angle_cls=True, det_db_box_thresh=0.3, det_db_score_mode="slow",
use_space_char=True, max_text_length=512, ocr_version="PP-OCRv4", det_max_side_len=4000)
ocr.ocr("1.jpg")

④ 图片分辨率与压缩问题
请确保你本地读取的图片原始分辨率未被压缩。某些情况下默认图像读取方法会改变原始图像特性，影响识别效果。

⑤ Demo使用了优化推理环境
在线demo背后可能启用了优化推理环境、GPU支持以及特殊预处理策略，这在本地直接调用 create_pipeline 时未体现。如需提升精度，可参考官方文档部署高性能预测（https://github.com/PaddlePaddle/PaddleOCR/blob/develop/deploy/README.md）。

总结建议：

显式设置 det_max_side_len=4000
尝试将 use_doc_orientation_classify / unwarping / textline_orientation 等处理选项改为 True
使用 paddleocr 官方接口并指定 ocr_version="PP-OCRv4"
检查图片读取方式是否保持原始质量
如果必要，可对比使用 paddleocr 官网提供的命令行或高性能部署方案

参考相关 GitHub Issue：
你所遇到的问题和社区用户提出的问题本质一致，详见：
#14154

如还有疑问，建议贴出本地使用参数设置完整代码，便于进一步定位。

Response generated by 🤖 feifei-bot | chatgpt-4o-latest

⚠️ This is an AI-generated response. Please verify any critical information.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

为什么相同的文件在demo上和pipeline调用结果会有很大的差别 #15002

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

为什么相同的文件在demo上和pipeline调用结果会有很大的差别 #15002

Uh oh!

lixjohn Apr 11, 2025

Replies: 1 comment

Uh oh!

GreatV Apr 11, 2025 Maintainer

lixjohn
Apr 11, 2025

GreatV
Apr 11, 2025
Maintainer