PaddleX/latest/pipeline_deploy/serving #3524

我在测试‘高稳定性服务化部署’，使用通用OCR SDK，本地docker启动成功，client.py测试GRPCInferenceService通过，Metrics Service访问成功，但是HTTPService请求失败，一直报错400 Bad Request。想问一下HTTPService的接口调用文档有没有，请求参数是什么，怎么才能正确访问HTTPService

1 reply

Bobholamovic Apr 28, 2025 — with giscus
Maintainer

你好，我们暂时没有提供HTTPService的调用文档，目前只支持gRPC～

wxz5459 · 2025-05-29T09:33:40Z

wxz5459
May 29, 2025 — with giscus

SDK的下载链接都失效了，麻烦修复下吧

3 replies

Bobholamovic May 30, 2025
Maintainer

你好，目前这块正在升级中，文档先更新了，最新的3.0.1 SDK预计今天之内上传～

Danee-wawawa May 30, 2025

你好，目前这块正在升级中，文档先更新了，最新的3.0.1 SDK预计今天之内上传～

您好，想问下此次更新主要是更新了什么内容呢？

Bobholamovic May 30, 2025
Maintainer

这次主要是bug修复，去除了一些无效的参数，另外PP-ChatOCR产线也补充了一个use_textline_orientation参数

LonerangerLR · 2025-06-03T06:14:12Z

LonerangerLR
Jun 3, 2025 — with giscus

您好
ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.0.1-gpu
和
ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/paddlex:paddlex3.0.0-paddlepaddle3.0.0-gpu-cuda11.8-cudnn8.9-trt8.6
两个镜像的区别是什么

1 reply

Bobholamovic Jun 13, 2025
Maintainer

前者是部署镜像，用于高稳定性服务化部署，内置了相关环境；后者是开发镜像，主要用来开发调试paddlex代码，如果对镜像体积不关心，也可以用在部署（高性能推理、基础服务化部署等）。

wu2754522801 · 2025-06-03T11:22:27Z

wu2754522801
Jun 3, 2025 — with giscus

docker部署的，为什么λ localhost ~/PaddleX paddlex --serve --pipeline OCR
Creating model: ('PP-LCNet_x1_0_doc_ori', None)
Using official model (PP-LCNet_x1_0_doc_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Illegal instruction (core dumped)执行启动服务化命令报错啊，没有看到文档中的输出信息，这是什么原因啊？这个是麒麟的linux系统
[root@localhost ~]# docker --version
Docker version 27.0.3, build 7d4bcd8

3 replies

Bobholamovic Jun 13, 2025
Maintainer

可能需要先确认一下本地是否能跑通

wu2754522801 Jun 18, 2025 — with giscus

我使用阿里云或者腾讯云的linux系统就可以，同样的命令，但是使用麒麟系统的这个就不太行
[root@localhost ~]# lscpu
架构： x86_64
CPU 运行模式： 32-bit, 64-bit
字节序： Little Endian
Address sizes: 48 bits physical, 48 bits virtual
CPU: 16
在线 CPU 列表： 0-15
每个核的线程数： 1
每个座的核数： 8
座： 2
NUMA 节点： 1
厂商 ID： AuthenticAMD
CPU 系列： 15
型号： 6
型号名称： Hygon C86 7375 32-core Processor
步进： 3
CPU MHz： 1999.999
BogoMIPS： 3999.99
超管理器厂商： KVM
虚拟化类型：完全
L1d 缓存： 1 MiB
L1i 缓存： 1 MiB
L2 缓存： 8 MiB
L3 缓存： 32 MiB
NUMA 节点0 CPU： 0-15

是不是不太支持这个版本呀？

Bobholamovic Jun 18, 2025
Maintainer

建议可以提一个issue，我们会跟进这个问题

lilong1988 · 2025-06-13T03:36:38Z

lilong1988
Jun 13, 2025 — with giscus

高稳定性服务化部署如何通过http方式调用

5 replies

Bobholamovic Jun 13, 2025
Maintainer

暂不支持，后续我们会考虑添加HTTP调用示例～

yaohongfenglove Jul 21, 2025 — with giscus

文档中说到手动构造http请求的方式，好像不能用呀，端口都没通。
如果别人来调用ocr服务，请问我是需要在grpc的基础上使用flask之类的构造http服务吗？谢谢大佬

Bobholamovic Jul 21, 2025
Maintainer

请问具体报什么错呢？

yaohongfenglove Jul 21, 2025 — with giscus

已解决，是我机器端口号冲突了。把--network host改为-p 7090:8000 -p 7091:8001 -p 7092:8002就可以了。
端口号冲突时，目前运行启动脚本不会报错，不是很友好呢

Bobholamovic Jul 21, 2025
Maintainer

感谢你的反馈，后续我们将考虑增加端口占用检查，并优化错误提示

ChangDong001 · 2025-06-18T04:03:41Z

ChangDong001
Jun 18, 2025 — with giscus

您好，这个SDK又不能下载了

1 reply

Bobholamovic Jun 18, 2025
Maintainer

不好意思，这几天在更新中，预计今天之内会上传。

wqcai · 2025-07-10T10:43:40Z

wqcai
Jul 10, 2025 — with giscus

您好，想请教下CUDA版本为12.4的话怎么获取ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.1-gpu的版本这个只支持CUDA版本为11.8的吧？

5 replies

Bobholamovic Jul 10, 2025
Maintainer

CUDA驱动支持到12.4应该也可以运行CUDA 11.8的镜像，您可以直接尝试～我们目前暂时没有提供更高版本CUDA Runtime的镜像

wqcai Jul 17, 2025 — with giscus

你好，我用paddlex3.0.0rc1-gpu版本能跑成功了但是OCR识别的结果还是和cpu一样需要10秒左右；显卡是Tesla T4 请问还有其它调优方式吗？config_gpu.pbtxt配置是这样：
backend: "python"
max_batch_size: 5
input [
{
name: "input"
data_type: TYPE_STRING
dims: [ 1 ]
}
]
output [
{
name: "output"
data_type: TYPE_STRING
dims: [ 1 ]
}
]
instance_group [
{
count: 2
kind: KIND_GPU
gpus: [ 0 ]
}
]

Bobholamovic Jul 17, 2025
Maintainer

请使用paddlex和paddle框架最新版本吧 3.0.0rc1应该是很早以前的版本～

wqcai Jul 17, 2025 — with giscus

之前只是paddlex3.1-gpu会报如下错误：Traceback (most recent call last):
File "/paddlex/py310/lib/python3.10/site-packages/paddlex_hps_server/base_model.py", line 88, in execute
result_or_output = self.run(input_, log_id)
File "/paddlex/var/paddlex_model_repo/ocr/1/model.py", line 80, in run
images, data_info = utils.file_to_images(
File "/paddlex/py310/lib/python3.10/site-packages/paddlex/inference/serving/infra/utils.py", line 251, in file_to_images
images, data_info = read_pdf(file_bytes, max_num_imgs=max_num_imgs)
File "/paddlex/py310/lib/python3.10/site-packages/paddlex/utils/deps.py", line 135, in wrapper
return func(*args, **kwargs)
File "/paddlex/py310/lib/python3.10/site-packages/paddlex/inference/serving/infra/utils.py", line 190, in read_pdf
doc = pdfium.PdfDocument(bytes)
File "/paddlex/py310/lib/python3.10/site-packages/pypdfium2/_helpers/document.py", line 78, in init
self.raw, to_hold, to_close = _open_pdf(self._input, self._password, self._autoclose)
File "/paddlex/py310/lib/python3.10/site-packages/pypdfium2/_helpers/document.py", line 678, in _open_pdf
raise PdfiumError(f"Failed to load document (PDFium: {pdfium_i.ErrorToStr.get(err_code)}).")
pypdfium2._helpers.misc.PdfiumError: Failed to load document (PDFium: Data format error).

Bobholamovic Jul 17, 2025
Maintainer

这个问题看起来是因为我们底层使用的pypdfium2库无法正确解析这个PDF，建议可以到pypdfium2库提一个issue～

azyhCoding · 2025-07-11T02:32:18Z

azyhCoding
Jul 11, 2025 — with giscus

服务化部署如何开启高性能推理？

3 replies

Bobholamovic Jul 11, 2025
Maintainer

基础服务化部署在启动服务时指定参数--use_hpip，高稳定性服务化部署设置环境变量PADDLEX_HPS_USE_HPIP为1。

azyhCoding Jul 11, 2025 — with giscus

高性能推理插件已经安装，但是加上这个参数启动时报错，不加这个参数是可以正常启动的。

paddlex --serve --use_hpip --pipeline OCR

Creating model: ('PP-LCNet_x1_0_doc_ori', None)
Using official model (PP-LCNet_x1_0_doc_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
/usr/local/lib/python3.10/dist-packages/setuptools-68.2.2-py3.10.egg/_distutils_hack/init.py:18: UserWarning: Distutils was imported before Setuptools, but importing Setuptools also replaces the distutils module in sys.modules. This may lead to undesirable behaviors or errors. To avoid these issues, avoid using distutils directly, ensure that setuptools is installed in the traditional way (e.g. not an editable install), and/or make sure that setuptools is always imported before distutils.
warnings.warn(
/usr/local/lib/python3.10/dist-packages/setuptools-68.2.2-py3.10.egg/_distutils_hack/init.py:33: UserWarning: Setuptools is replacing distutils.
warnings.warn("Setuptools is replacing distutils.")
The Paddle Inference backend is selected with the default configuration. This may not provide optimal performance.
Using Paddle Inference backend
Paddle predictor option: device_type: cpu, device_id: None, trt_dynamic_shapes: {'x': [[1, 3, 224, 224], [1, 3, 224, 224], [8, 3, 224, 224]]}, run_mode: paddle, cpu_threads: 8, delete_pass: [], enable_new_ir: True, enable_cinn: False, trt_cfg_setting: {}, trt_use_dynamic_shapes: True, trt_collect_shape_range_info: True, trt_discard_cached_shape_range_info: False, trt_dynamic_shape_input_data: None, trt_shape_range_info_path: None, trt_allow_rebuild_at_runtime: True
Creating model: ('UVDoc', None)
Using official model (UVDoc), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
The Paddle Inference backend is selected with the default configuration. This may not provide optimal performance.
Using Paddle Inference backend
Paddle predictor option: device_type: cpu, device_id: None, trt_dynamic_shapes: {'img': [[1, 3, 128, 64], [1, 3, 256, 128], [8, 3, 512, 256]]}, run_mode: paddle, cpu_threads: 8, delete_pass: [], enable_new_ir: True, enable_cinn: False, trt_cfg_setting: {}, trt_use_dynamic_shapes: True, trt_collect_shape_range_info: True, trt_discard_cached_shape_range_info: False, trt_dynamic_shape_input_data: None, trt_shape_range_info_path: None, trt_allow_rebuild_at_runtime: True
Creating model: ('PP-LCNet_x1_0_textline_ori', None)
Using official model (PP-LCNet_x1_0_textline_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Traceback (most recent call last):
File "/usr/local/bin/paddlex", line 8, in
sys.exit(console_entry())
File "/root/PaddleX/paddlex/main.py", line 26, in console_entry
main()
File "/root/PaddleX/paddlex/paddlex_cli.py", line 444, in main
serve(
File "/root/PaddleX/paddlex/paddlex_cli.py", line 343, in serve
pipeline = create_pipeline(
File "/root/PaddleX/paddlex/inference/pipelines/init.py", line 165, in create_pipeline
pipeline = BasePipeline.get(pipeline_name)(
File "/root/PaddleX/paddlex/utils/deps.py", line 195, in _wrapper
return old_init_func(self, *args, **kwargs)
File "/root/PaddleX/paddlex/inference/pipelines/_parallel.py", line 103, in init
self._pipeline = self._create_internal_pipeline(config, self.device)
File "/root/PaddleX/paddlex/inference/pipelines/_parallel.py", line 158, in _create_internal_pipeline
return self._pipeline_cls(
File "/root/PaddleX/paddlex/inference/pipelines/ocr/pipeline.py", line 83, in init
self.textline_orientation_model = self.create_model(
File "/root/PaddleX/paddlex/inference/pipelines/base.py", line 107, in create_model
model = create_predictor(
File "/root/PaddleX/paddlex/inference/models/init.py", line 77, in create_predictor
return BasePredictor.get(model_name)(
File "/root/PaddleX/paddlex/inference/models/image_classification/predictor.py", line 49, in init
self.preprocessors, self.infer, self.postprocessors = self._build()
File "/root/PaddleX/paddlex/inference/models/image_classification/predictor.py", line 82, in _build
infer = self.create_static_infer()
File "/root/PaddleX/paddlex/inference/models/base/predictor/base_predictor.py", line 242, in create_static_infer
return HPInfer(
File "/root/PaddleX/paddlex/utils/deps.py", line 148, in _wrapper
return old_init_func(self, *args, **kwargs)
File "/root/PaddleX/paddlex/inference/models/common/static_infer.py", line 575, in init
backend, backend_config = self._determine_backend_and_config()
File "/root/PaddleX/paddlex/inference/models/common/static_infer.py", line 630, in _determine_backend_and_config
raise RuntimeError(
RuntimeError: No inference backend and configuration could be suggested. Reason: 'PP-LCNet_x1_0_textline_ori' is not a known model.

Bobholamovic Jul 11, 2025
Maintainer

请确认你使用的是最新版本的paddlex～

bmmpp · 2025-07-14T03:32:08Z

bmmpp
Jul 14, 2025 — with giscus

执行 1.1 paddlex --install serving 一直报错啊，os 版本 AlmaLinux9.6，在linux 上你们强烈建议使用 docker安装paddlex ，这里怎么没有使用docekr 安装的paddlex 执行服务部署的教程呀，请大佬指教，谢谢！

Using cached future-1.0.0-py3-none-any.whl (491 kB)
Downloading pycryptodome-3.23.0-cp37-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.3 MB)
━━━━━━━━━━━━━━━━━━╺━━━━━━━━━━━━━━━━━━━━━ 1.0/2.3 MB 16.1 kB/s eta 0:01:16

[notice] A new release of pip is available: 25.0.1 -> 25.1.1
[notice] To update, run: pip3.12 install --upgrade pip
ERROR: Exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.12/site-packages/pip/_vendor/urllib3/response.py", line 438, in _error_catcher
yield
File "/usr/local/lib/python3.12/site-packages/pip/_vendor/urllib3/response.py", line 561, in read
data = self._fp_read(amt) if not fp_closed else b""
^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_vendor/urllib3/response.py", line 527, in _fp_read
return self._fp.read(amt) if amt is not None else self._fp.read()
^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_vendor/cachecontrol/filewrapper.py", line 98, in read
data: bytes = self.__fp.read(amt)
^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/http/client.py", line 479, in read
s = self.fp.read(amt)
^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/socket.py", line 720, in readinto
return self._sock.recv_into(b)
^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/ssl.py", line 1251, in recv_into
return self.read(nbytes, buffer)
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/ssl.py", line 1103, in read
return self._sslobj.read(len, buffer)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TimeoutError: The read operation timed out

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/usr/local/lib/python3.12/site-packages/pip/_internal/cli/base_command.py", line 106, in _run_wrapper
status = _inner_run()
^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_internal/cli/base_command.py", line 97, in _inner_run
return self.run(options, args)
^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_internal/cli/req_command.py", line 67, in wrapper
return func(self, options, args)
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_internal/commands/install.py", line 386, in run
requirement_set = resolver.resolve(
^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_internal/resolution/resolvelib/resolver.py", line 179, in resolve
self.factory.preparer.prepare_linked_requirements_more(reqs)
File "/usr/local/lib/python3.12/site-packages/pip/_internal/operations/prepare.py", line 554, in prepare_linked_requirements_more
self._complete_partial_requirements(
File "/usr/local/lib/python3.12/site-packages/pip/_internal/operations/prepare.py", line 469, in _complete_partial_requirements
for link, (filepath, _) in batch_download:
^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_internal/network/download.py", line 184, in call
for chunk in chunks:
^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_internal/cli/progress_bars.py", line 55, in _rich_progress_bar
for chunk in iterable:
^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_internal/network/utils.py", line 65, in response_chunks
for chunk in response.raw.stream(
^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_vendor/urllib3/response.py", line 622, in stream
data = self.read(amt=amt, decode_content=decode_content)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/pip/_vendor/urllib3/response.py", line 560, in read
with self._error_catcher():
^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/contextlib.py", line 158, in exit
self.gen.throw(value)
File "/usr/local/lib/python3.12/site-packages/pip/_vendor/urllib3/response.py", line 443, in _error_catcher
raise ReadTimeoutError(self._pool, None, "Read timed out.")
pip._vendor.urllib3.exceptions.ReadTimeoutError: HTTPSConnectionPool(host='files.pythonhosted.org', port=443): Read timed out.
Traceback (most recent call last):
File "/home/vision/.local/bin/paddlex", line 8, in
sys.exit(console_entry())
^^^^^^^^^^^^^^^
File "/home/vision/.local/lib/python3.12/site-packages/paddlex/main.py", line 26, in console_entry
main()
File "/home/vision/.local/lib/python3.12/site-packages/paddlex/paddlex_cli.py", line 441, in main
install(args)
File "/home/vision/.local/lib/python3.12/site-packages/paddlex/paddlex_cli.py", line 275, in install
_install_serving_deps()
File "/home/vision/.local/lib/python3.12/site-packages/paddlex/paddlex_cli.py", line 228, in _install_serving_deps
install_packages(reqs)
File "/home/vision/.local/lib/python3.12/site-packages/paddlex/utils/install.py", line 69, in install_packages
return install_packages_from_requirements_file(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/vision/.local/lib/python3.12/site-packages/paddlex/utils/install.py", line 58, in install_packages_from_requirements_file
return subprocess.check_call(args)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/subprocess.py", line 413, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command '['/usr/local/bin/python3.12', '-m', 'pip', 'install', '-c', '/tmp/tmpdmh51963.txt', '-r', '/tmp/tmpow6ocinu.txt']' returned non-zero exit status 2.

15 replies

bmmpp Jul 14, 2025 — with giscus

好的，谢谢大佬，对于 2.4.2 可以使用grpc的方式吗，监听的有端口，是不是按http 的方式传入json就可以呢，如果前台要出图片流，或者 base64 字符， grpc 如何调用呢，谢谢大佬指导

Bobholamovic Jul 14, 2025
Maintainer

不好意思，我刚贴的文档是之前的版本的……可以参考这里：
https://paddlepaddle.github.io/PaddleX/3.1/pipeline_deploy/serving.html#24

grpc的话，建议用Python客户端来调用；HTTP的话，可以手动构造请求。

bmmpp Jul 15, 2025

谢谢大佬指导，我们的业务环境比较特殊，我们前台app 是安卓自动循环调用相机每帧去识别的，所以有大量的请求发送到server端，而且每次识别要尽量快，既然容器监听的有grpc的端口，那么这种场景grpc 应该比http 性能要好很多吧，可以指导一下grpc的方式吗？

ethlucky Jul 16, 2025 — with giscus

这个版本 ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.0.3-gpu \ 我运行正常，但是运行
ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.1-cpu \后报以下错误了

docker run
-it
-e PADDLEX_HPS_DEVICE_TYPE=gpu
-e PADDLEX_HPS_USE_HPIP=1
-v "$(pwd)":/app
-w /app
--gpus all
--init
--network host
--shm-size 8g
ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.1-cpu
/bin/bash server.sh

E0716 02:02:04.653384 7 model_repository_manager.cc:1890] Poll failed for model directory 'ocr': instance group ocr_0 of model ocr has kind KIND_GPU but server does not support GPUs
I0716 02:02:04.653437 7 server.cc:522]
+------------------+------+
| Repository Agent | Path |
+------------------+------+
+------------------+------+

I0716 02:02:04.653444 7 server.cc:549]
+---------+------+--------+
| Backend | Path | Config |
+---------+------+--------+
+---------+------+--------+

I0716 02:02:04.653449 7 server.cc:592]
+-------+---------+--------+
| Model | Version | Status |
+-------+---------+--------+
+-------+---------+--------+

I0716 02:02:04.653482 7 tritonserver.cc:1920]
+----------------------------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| Option | Value |
+----------------------------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| server_id | triton |
| server_version | 2.15.0 |
| server_extensions | classification sequence model_repository model_repository(unload_dependents) schedule_policy model_configuration system_shared_memory cuda_shared_memory binary_tensor_data statistics |
| model_repository_path[0] | /paddlex/var/paddlex_model_repo |
| model_control_mode | MODE_NONE |
| strict_model_config | 1 |
| rate_limit | OFF |
| pinned_memory_pool_byte_size | 268435456 |
| response_cache_byte_size | 0 |
| min_supported_compute_capability | 0.0 |
| strict_readiness | 1 |
| exit_timeout | 30 |
+----------------------------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+

I0716 02:02:04.653493 7 server.cc:252] Waiting for in-flight requests to complete.
I0716 02:02:04.653494 7 server.cc:267] Timeout 30: Found 0 live models and 0 in-flight non-inference requests
error: creating server: Internal - failed to load all models

Bobholamovic Jul 16, 2025
Maintainer

因为你使用的是CPU镜像～请使用GPU镜像吧（以-gpu为后缀）

bmmpp · 2025-07-16T03:45:03Z

bmmpp
Jul 16, 2025 — with giscus

https://paddlepaddle.github.io/PaddleX/latest/pipeline_deploy/serving.html#23
这种方式 docker run 启动的如何调整cpu 线程数，要在每个 SubModules 中都指定吗
hpi_config：
backend_config：
cpu_num_threads：20 这样吗？

调整后的 pipeline_config.yaml

pipeline_name: OCR

text_type: general

use_doc_preprocessor: True
use_textline_orientation: True

SubPipelines:
DocPreprocessor:
pipeline_name: doc_preprocessor
use_doc_orientation_classify: True
use_doc_unwarping: True
SubModules:
DocOrientationClassify:
module_name: doc_text_orientation
model_name: PP-LCNet_x1_0_doc_ori
model_dir: null
hpi_config：
backend_config：
cpu_num_threads：20
DocUnwarping:
module_name: image_unwarping
model_name: UVDoc
model_dir: null
hpi_config：
backend_config：
cpu_num_threads：20

SubModules:
TextDetection:
module_name: text_detection
model_name: PP-OCRv4_mobile_det
model_dir: null
limit_side_len: 960
limit_type: max
max_side_limit: 4000
thresh: 0.3
box_thresh: 0.6
unclip_ratio: 1.5
hpi_config：
backend_config：
cpu_num_threads：20
TextLineOrientation:
module_name: textline_orientation
model_name: PP-LCNet_x0_25_textline_ori
model_dir: null
batch_size: 6
hpi_config：
backend_config：
cpu_num_threads：20
TextRecognition:
module_name: text_recognition
model_name: en_PP-OCRv4_mobile_rec
model_dir: null
batch_size: 6
score_thresh: 0.0
hpi_config：
backend_config：
cpu_num_threads：20

12 replies

Bobholamovic Jul 16, 2025
Maintainer

明白了，看起来是因为部分模型使用的后端（比如paddle）不支持这个配置，所以出现了上面的报错，在不手动固定后端的情况下，可能还是需要逐模型配置～

ethlucky Jul 16, 2025 — with giscus

{
"inputs": [
{
"name": "input",
"shape": [1, 1],
"datatype": "BYTES",
"data": [
{
"file": "https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_001.png",
"visualize": false
}
]
}
],
"outputs": [
{
"name": "output"
}
]
}

大佬，ocr识别的话，这个file必须必须是个url地址吗？

Bobholamovic Jul 16, 2025
Maintainer

不是，请参考产线使用文档吧，其中有详细的API参数说明。

Kommisaar Jul 18, 2025 — with giscus

启用hpi时，启动服务发生以下错误

Creating model: ('PP-DocLayout_plus-L', None)
Using official model (PP-DocLayout_plus-L), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
INFO:root:Create a symbolic link pointing to /paddlex/py310/lib/python3.10/site-packages/ultra_infer/libs/third_libs/tensorrt/lib/libnvcaffe_parser.so.8 named /paddlex/py310/lib/python3.10/site-packages/ultra_infer/libs/third_libs/tensorrt/lib/libnvcaffe_parser.so.
INFO:root:Create a symbolic link pointing to /paddlex/py310/lib/python3.10/site-packages/ultra_infer/libs/third_libs/tensorrt/lib/libnvinfer_plugin.so.8 named /paddlex/py310/lib/python3.10/site-packages/ultra_infer/libs/third_libs/tensorrt/lib/libnvinfer_plugin.so.
INFO:root:Create a symbolic link pointing to /paddlex/py310/lib/python3.10/site-packages/ultra_infer/libs/third_libs/tensorrt/lib/libnvinfer.so.8 named /paddlex/py310/lib/python3.10/site-packages/ultra_infer/libs/third_libs/tensorrt/lib/libnvinfer.so.
INFO:root:Create a symbolic link pointing to /paddlex/py310/lib/python3.10/site-packages/ultra_infer/libs/third_libs/tensorrt/lib/libnvonnxparser.so.8 named /paddlex/py310/lib/python3.10/site-packages/ultra_infer/libs/third_libs/tensorrt/lib/libnvonnxparser.so.
INFO:root:Create a symbolic link pointing to /paddlex/py310/lib/python3.10/site-packages/ultra_infer/libs/third_libs/tensorrt/lib/libnvparsers.so.8 named /paddlex/py310/lib/python3.10/site-packages/ultra_infer/libs/third_libs/tensorrt/lib/libnvparsers.so.
Automatically converting PaddlePaddle model to ONNX format
Inference backend: tensorrt
Inference backend config: precision='fp16' use_dynamic_shapes=True dynamic_shapes={'im_shape': [[1, 2], [1, 2], [8, 2]], 'image': [[1, 3, 800, 800], [1, 3, 800, 800], [8, 3, 800, 800]], 'scale_factor': [[1, 2], [1, 2], [8, 2]]}
[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(567)::BuildTrtEngine [TrtBackend] Use FP16 to inference.
[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(572)::BuildTrtEngine Start to building TensorRT Engine...
symbolic_global_padding.cpp:929: DCHECK(use->is_use_only()) failed.
symbolic_global_padding.cpp:929: DCHECK(use->is_use_only()) failed.
symbolic_global_padding.cpp:929: DCHECK(use->is_use_only()) failed.
[ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(239)::log 10: Could not find any implementation for node {ForeignNode[eager_tmp_2_deepcopy_547...Reshape.248]}.
[ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(239)::log 10: [optimizer.cpp::computeCosts::3869] Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[eager_tmp_2_deepcopy_547...Reshape.248]}.)
[ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(636)::BuildTrtEngine Failed to call buildSerializedNetwork().
[ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(752)::CreateTrtEngineFromOnnx Failed to build tensorrt engine.
[INFO] ultra_infer/runtime/runtime.cc(320)::CreateTrtBackend Runtime initialized with Backend::TRT in Device::GPU.

Bobholamovic Jul 18, 2025
Maintainer

如果这个错误不影响正常使用（只是程序卡住打印这些错误），可以忽略～

Kommisaar · 2025-07-18T08:01:22Z

Kommisaar
Jul 18, 2025 — with giscus

对于这种启动服务后还需下载的模型，因为网络原因无法下载怎么办，可以手动挂载吗

I0718 07:56:17.887946 7 grpc_server.cc:4117] Started GRPCInferenceService at 0.0.0.0:8001
I0718 07:56:17.888270 7 http_server.cc:2815] Started HTTPService at 0.0.0.0:8000
I0718 07:56:17.930587 7 http_server.cc:167] Started Metrics Service at 0.0.0.0:8002
Creating model: ('PP-LCNet_x1_0_doc_ori', None)
Using official model (PP-LCNet_x1_0_doc_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Connecting to https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/PP-LCNet_x1_0_doc_ori_infer.tar ...
[ ERROR] [2025-07-18 07:56:46,523] [be1e7915a6f040f6a15a0381acaa85bc] [508e2ed5-7a07-4079-ae38-30d8abe620c8] - Unhandled exception
Traceback (most recent call last):
File "/paddlex/py310/lib/python3.10/site-packages/urllib3/connection.py", line 198, in _new_conn
sock = connection.create_connection(
File "/paddlex/py310/lib/python3.10/site-packages/urllib3/util/connection.py", line 60, in create_connection
for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
File "/usr/lib/python3.10/socket.py", line 967, in getaddrinfo
for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
socket.gaierror: [Errno -3] Temporary failure in name resolution

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "/paddlex/py310/lib/python3.10/site-packages/urllib3/connectionpool.py", line 787, in urlopen
response = self._make_request(
File "/paddlex/py310/lib/python3.10/site-packages/urllib3/connectionpool.py", line 488, in _make_request
raise new_e
File "/paddlex/py310/lib/python3.10/site-packages/urllib3/connectionpool.py", line 464, in _make_request
self._validate_conn(conn)
File "/paddlex/py310/lib/python3.10/site-packages/urllib3/connectionpool.py", line 1093, in _validate_conn
conn.connect()
File "/paddlex/py310/lib/python3.10/site-packages/urllib3/connection.py", line 704, in connect
self.sock = sock = self._new_conn()
File "/paddlex/py310/lib/python3.10/site-packages/urllib3/connection.py", line 205, in _new_conn
raise NameResolutionError(self.host, self, e) from e
urllib3.exceptions.NameResolutionError: <urllib3.connection.HTTPSConnection object at 0x7154cfde3550>: Failed to resolve 'paddle-model-ecology.bj.bcebos.com' ([Errno -3] Temporary failure in name resolution)

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "/paddlex/py310/lib/python3.10/site-packages/requests/adapters.py", line 667, in send
resp = conn.urlopen(
File "/paddlex/py310/lib/python3.10/site-packages/urllib3/connectionpool.py", line 841, in urlopen
retries = retries.increment(
File "/paddlex/py310/lib/python3.10/site-packages/urllib3/util/retry.py", line 519, in increment
raise MaxRetryError(_pool, url, reason) from reason # type: ignore[arg-type]
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='paddle-model-ecology.bj.bcebos.com', port=443): Max retries exceeded with url: /paddlex/official_inference_model/paddle3.0.0/PP-LCNet_x1_0_doc_ori_infer.tar (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7154cfde3550>: Failed to resolve 'paddle-model-ecology.bj.bcebos.com' ([Errno -3] Temporary failure in name resolution)"))

2 replies

Kommisaar Jul 18, 2025 — with giscus

再问一下，字体文件又该挂载到哪个位置？
Max retries exceeded with url: /paddlex/PaddleX3.0/fonts/PingFang-SC-Regular.ttf (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7fc311ed0940>: Failed to resolve 'paddle-model-ecology.bj.bcebos.com' ([Errno -3] Temporary failure in name resolution)"))

Kommisaar Jul 18, 2025 — with giscus

解决了

lilong1988 · 2025-07-21T07:07:15Z

lilong1988
Jul 21, 2025 — with giscus

2.4.2 手动构造 HTTP 请求，构造的请求体格式不对，"data"中的json应该是String类型。
{
"inputs": [
{
"name": "input",
"shape": [1, 1],
"datatype": "BYTES",
"data": [
"{ "file": "https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_001.png","visualize": false }"
]
}
],
"outputs": [
{
"name": "output"
}
]
}

6 replies

ethlucky Jul 22, 2025 — with giscus

pipeline_name: OCR

text_type: general

use_doc_preprocessor: True
use_textline_orientation: True

SubPipelines:
DocPreprocessor:
pipeline_name: doc_preprocessor
use_doc_orientation_classify: True
use_doc_unwarping: True
SubModules:
DocOrientationClassify:
module_name: doc_text_orientation
model_name: PP-LCNet_x1_0_doc_ori
model_dir: null
DocUnwarping:
module_name: image_unwarping
model_name: UVDoc
model_dir: null

SubModules:
TextDetection:
module_name: text_detection
model_name: PP-OCRv5_server_det
model_dir: null
limit_side_len: 960
limit_type: max
max_side_limit: 4000
thresh: 0.3
box_thresh: 0.6
unclip_ratio: 1.5
TextLineOrientation:
module_name: textline_orientation
model_name: PP-LCNet_x0_25_textline_ori
model_dir: null
batch_size: 6
TextRecognition:
module_name: text_recognition
model_name: PP-OCRv5_server_rec
model_dir: null
batch_size: 6
score_thresh: 0.0
~

Inference backend config: precision='fp16' use_dynamic_shapes=True dynamic_shapes={'x': [[1, 3, 224, 224], [1, 3, 224, 224], [8, 3, 224, 224]]}
[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(567)::BuildTrtEngine [TrtBackend] Use FP16 to inference.
[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(572)::BuildTrtEngine Start to building TensorRT Engine...
[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(659)::BuildTrtEngine TensorRT Engine is built successfully.
[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(661)::BuildTrtEngine Serialize TensorRTEngine to local file /root/.paddlex/official_models/PP-LCNet_x1_0_doc_ori/.cache/tensorrt/trt_serialized.trt.
[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(672)::BuildTrtEngine TensorRTEngine is serialized to local file /root/.paddlex/official_models/PP-LCNet_x1_0_doc_ori/.cache/tensorrt/trt_serialized.trt, we can load this model from the serialized engine directly next time.
[ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(239)::log 3: [runtime.cpp::~Runtime::346] Error Code 3: API Usage Error (Parameter check failed at: runtime/rt/runtime.cpp::~Runtime::346, condition: mEngineCounter.use_count() == 1. Destroying a runtime before destroying deserialized engines created by the runtime leads to undefined behavior.
)

pipeline_config.yaml配置修改下模型为PP-OCRv5_server_det, PP-OCRv5_server_rec 启动报以上错误

原来使用的mobile模型，我就修改了这个地方，起订报错了，这两个模型不可用吗？

ethlucky Jul 22, 2025 — with giscus

NFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(572)::BuildTrtEngine Start to building TensorRT Engine...
[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(659)::BuildTrtEngine TensorRT Engine is built successfully.
[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(661)::BuildTrtEngine Serialize TensorRTEngine to local file /root/.paddlex/official_models/PP-LCNet_x0_25_textline_ori/.cache/tensorrt/trt_serialized.trt.
[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(672)::BuildTrtEngine TensorRTEngine is serialized to local file /root/.paddlex/official_models/PP-LCNet_x0_25_textline_ori/.cache/tensorrt/trt_serialized.trt, we can load this model from the serialized engine directly next time.
[ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(239)::log 3: [runtime.cpp::~Runtime::346] Error Code 3: API Usage Error (Parameter check failed at: runtime/rt/runtime.cpp::~Runtime::346, condition: mEngineCounter.use_count() == 1. Destroying a runtime before destroying deserialized engines created by the runtime leads to undefined behavior.
)

[INFO] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(572)::BuildTrtEngine Start to building TensorRT Engine...
[ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(239)::log 4: Could not find any implementation for node Conv.140 + BatchNormalization.86 + Relu.59 due to insufficient workspace. See verbose log for requested sizes.
[ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(239)::log 4: [optimizer.cpp::computeCosts::3867] Error Code 4: Internal Error (Could not find any implementation for node Conv.140 + BatchNormalization.86 + Relu.59 due to insufficient workspace. See verbose log for requested sizes.)
[ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(636)::BuildTrtEngine Failed to call buildSerializedNetwork().
[ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(752)::CreateTrtEngineFromOnnx Failed to build tensorrt engine.
[INFO] ultra_infer/runtime/runtime.cc(320)::CreateTrtBackend Runtime initialized with Backend::TRT in Device::GPU.

字后还有两个报错，ERROR] ultra_infer/runtime/backends/tensorrt/trt_backend.cc(239)::log 4: Could not find any implementation for node Conv.140 + BatchNormalization.86 + Relu.59 due to insufficient workspace.

这个是空间不足吗？

Bobholamovic Jul 22, 2025
Maintainer

这些错误如果不影响正常直行的话可以忽略～

ethlucky Jul 23, 2025 — with giscus

我这边调用报错了，没法调用

如果ocr mobile5模型有些内容的话识别不出来，换成ocr server5的话，识别精度会不会好点呢

Bobholamovic Jul 23, 2025
Maintainer

请问是在PaddleX官方镜像中安装和使用高性能推理插件的吗？

server模型的识别精度更高，不过可能需要消耗更多的计算资源。

yaohongfenglove · 2025-07-25T06:45:40Z

yaohongfenglove
Jul 25, 2025 — with giscus

请大佬有空了帮忙看看这个问题：

服务启动命令：

docker run -it -e PADDLEX_HPS_DEVICE_TYPE=gpu -v "/opt/paddlex_hps_OCR_sdk/server":/app -w /app --rm --gpus all --init -p 7090:8000 -p 7091:8001 -p 7092:8002 --shm-size 8g ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.1-gpu /bin/bash server.sh

~/paddlex_hps_OCR_sdk/server/pipeline_config.yaml配置文件的内容

pipeline_name: OCR

text_type: general

use_doc_preprocessor: True
use_textline_orientation: True

SubPipelines:
  DocPreprocessor:
    pipeline_name: doc_preprocessor
    use_doc_orientation_classify: True
    use_doc_unwarping: True
    SubModules:
      DocOrientationClassify:
        module_name: doc_text_orientation
        model_name: PP-LCNet_x1_0_doc_ori
        model_dir: null
      DocUnwarping:
        module_name: image_unwarping
        model_name: UVDoc
        model_dir: null

SubModules:
  TextDetection:
    module_name: text_detection
    model_name: PP-OCRv5_server_det
    model_dir: null
    limit_side_len: 1920
    limit_type: max
    max_side_limit: 4000
    thresh: 0.3
    box_thresh: 0.6
    unclip_ratio: 1.5
  TextLineOrientation:
    module_name: textline_orientation
    model_name: PP-LCNet_x1_0_textline_ori 
    model_dir: null
    batch_size: 6    
  TextRecognition:
    module_name: text_recognition
    model_name: PP-OCRv5_server_rec 
    model_dir: null
    batch_size: 6

调用命令

curl --location '127.0.0.1:7090/v2/models/ocr/infer' \
--header 'Content-Type: application/json' \
--data '{
    "inputs": [
        {
            "name": "input",
            "shape": [
                1,
                1
            ],
            "datatype": "BYTES",
            "data": [
                "{ \"file\": \"https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_ocr_rec_001.png\",\"visualize\": true,\"use_doc_unwarping\": false }"
            ]
        }
    ],
    "outputs": [
        {
            "name": "output"
        }
    ]
}'

以上运行后，有两个问题。

调用时"visualize": true，返回的base64无法还原为图片，添加的头为data:image/png;base64,
调用时"use_doc_unwarping": false，但根据结果值可以推测实际上运行了use_doc_unwarping

2 replies

Bobholamovic Jul 25, 2025
Maintainer

返回的base64应该是不带有头data:image/png;base64的，可以直接解析内容；
由于历史原因，服务参数和本地API参数的命名风格不一致，就use_doc_unwarping参数而言，对应的服务参数应该是useDocUnwarping，详情可以参考服务API reference。

yaohongfenglove Jul 25, 2025 — with giscus

明白了，感谢，useDocUnwarping确实OK了。

图片无法还原的问题我也在找到了，返回了三张图片，而不是一张

github2136 · 2025-07-25T07:28:33Z

github2136
Jul 25, 2025 — with giscus

使用paddlex --serve --pipeline small_object_detection --device cpu 启动如何指定是使用PP-YOLOE_plus_SOD-S 还是使用 PP-YOLOE_plus_SOD-L 模型？

3 replies

yaohongfenglove Jul 25, 2025 — with giscus

paddlex --get_pipeline_config small_object_detection --save_path ./my_path
vim ./my_path/small_object_detection.yaml
paddlex --serve --pipeline ./my_path/small_object_detection.yaml --device cpu

github2136 Jul 25, 2025 — with giscus

https://paddlepaddle.github.io/PaddleX/latest/en/support_list/models_list.html?h=small+object+detection+yaml#object-detection-module
是下载这里的PP-YOLOE_plus_SOD-S.yaml 文件嘛？

yaohongfenglove Jul 25, 2025 — with giscus

vim ./my_path/small_object_detection.yaml 改small_object_detection.yaml中的model_name配置就好了，会自动下载的

songmingjun3 · 2025-07-31T03:45:24Z

songmingjun3
Jul 31, 2025 — with giscus

高稳定性服务化部署可以不通过docker的方式部署吗？能不能出个具体的教程

1 reply

Bobholamovic Jul 31, 2025
Maintainer

你好，目前暂不支持docker以外的方式

wenshinlee · 2025-08-01T01:11:38Z

wenshinlee
Aug 1, 2025 — with giscus

你好，我想在一个triton中部署两个不同的OCR模型（通用和单独训练过的），我要问一下如何修改一下server.sh这个脚本？我看上面配置了pipe_line_config.yaml这个路径，但是两个模型，存在两个流水线文件，如何配置？还是需要启动两个docker？

4 replies

Bobholamovic Aug 1, 2025
Maintainer

目前建议启动两个docker容器来实现这一点

wenshinlee Aug 1, 2025 — with giscus

好的，希望能够解决这个问题，要不然很不方便。另外paddlex_hps_server.whl文件哪里可以获取到？

Bobholamovic Aug 1, 2025
Maintainer

高稳定性服务化部署的部分现在在这里开源了，可以参考～

wenshinlee Aug 1, 2025 — with giscus

谢谢

ganzhiming · 2025-08-15T01:58:21Z

ganzhiming
Aug 15, 2025 — with giscus

我用paddleX微调了Cascade-FasterRCNN-ResNet50-FPN模型，单模型评估后AP：63，然后通过服务化部署了object_detection产线，模型路径用的是自定义路径。在C#客户端通过http调用这个端口，单张3072*2048的图片，耗时2890毫秒，我们用的平台是windows，不太想用WSL的方式安装高性能推理，想请问一下，有没有其他的方式可以提高推理的速度？能直接在推理电脑里面能直接加载是做好的，类似于以前PaddelX GUI版本的，可以用原生库推理的，后面新版本的PaddleX会支持吗？

22 replies

ganzhiming Aug 21, 2025

更新完，测试OK,感谢解答

ganzhiming Aug 21, 2025

但我很好奇，连最简单的通用图像分类网络，都无法加载高性能插件，我都怀疑是不是我的配置有问题，还是这个高性能插件就只支持个例的模型，我测试几个产线，都是报The Paddle Inference backend is selected with the default configuration. This may not provide optimal performance
通用图像分类产线加载日志如下:
λ 335394c1debc /home paddlex --serve --pipeline image_classification --device gpu:0 --host 0.0.0.0 --port 8010 --use_hpip
Creating model: ('PP-LCNet_x0_5', None)
Using official model (PP-LCNet_x0_5), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Connecting to https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/PP-LCNet_x0_5_infer.tar ...
Downloading PP-LCNet_x0_5_infer.tar ...
[==================================================] 100.00%
Extracting PP-LCNet_x0_5_infer.tar
[==================================================] 100.00%
grep: warning: GREP_OPTIONS is deprecated; please use an alias or script
The Paddle Inference backend is selected with the default configuration. This may not provide optimal performance.
Using Paddle Inference backend
Paddle predictor option: device_type: gpu, device_id: 0, run_mode: paddle, trt_dynamic_shapes: {'x': [[1, 3, 224, 224], [1, 3, 224, 224], [8, 3, 224, 224]]}, cpu_threads: 10, delete_pass: [], enable_new_ir: True, enable_cinn: False, trt_cfg_setting: {}, trt_use_dynamic_shapes: True, trt_collect_shape_range_info: True, trt_discard_cached_shape_range_info: False, trt_dynamic_shape_input_data: None, trt_shape_range_info_path: None, trt_allow_rebuild_at_runtime: True, mkldnn_cache_capacity: 10
INFO: Started server process [4305]
INFO: Waiting for application startup.
INFO: Application startup complete.
INFO: Uvicorn running on http://0.0.0.0:8010 (Press CTRL+C to quit)

Bobholamovic Aug 21, 2025
Maintainer

考虑到你使用的是飞桨框架官方镜像，而不是PaddleX官方镜像，可能需要手动安装一些依赖才可以使用完整的高性能推理功能。详情请参考高性能推理指南。

ganzhiming Aug 22, 2025

好的，我参考高性能推理指南重新部署PadddleX官方镜像验证一下

ganzhiming Aug 22, 2025

zh这次是用PaddleX的官方镜像，这次高性能插件安装启动了吗？

Little-Star888 · 2025-08-22T00:56:26Z

Little-Star888
Aug 22, 2025 — with giscus

现在docker pull ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.2-gpu这个3.2版本是支持cuda12的吗

9 replies

Bobholamovic Aug 22, 2025
Maintainer

镜像已更新，大家可以尝试重新拉取

ganzhiming Aug 22, 2025

目前镜像还是11.8，是还没有更新吗？

Little-Star888 Aug 24, 2025 — with giscus

自定义dockerfile的话，需要cuda12版本的ultra_infer，目前gpu_hpi.txt里面的版本是ultra-infer-gpu-python @ https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/deploy/hpi/ultra_infer/releases/new_hpi/v1.2.0/ultra_infer_gpu_python-1.2.0-cp310-cp310-linux_x86_64.whl这个，是基于cuda11.8的，请问是否有cuda12版本的ultra_infer

Bobholamovic Aug 24, 2025
Maintainer

在这里可以找到：https://github.com/PaddlePaddle/PaddleX/blob/release/3.2/paddlex/hpip_links_cu12.html

Little-Star888 Aug 24, 2025 — with giscus

谢谢，已成功构建

ganzhiming · 2025-08-25T02:52:46Z

ganzhiming
Aug 25, 2025 — with giscus

请教一下，现在通过服务化部署产线，在容器里面显示的Time taken 都在100ms左右，但我在C#通过http通信耗时在400ms左右，我想了解一下这个耗时会有哪些因素影响？怎么可以调整达到容器的推理的速度呢？
容器里面的：[ INFO] [2025-08-25 02:45:47,004] [0f36f58ed2684be9b744719148f33584] [004c720c-0481-4d47-abc5-0b89e3171566] - Time taken: 77.164 ms

C#客户端的：推理耗时:456.5396毫秒

3 replies

Bobholamovic Aug 25, 2025
Maintainer

这样的话考虑是通信的耗时，请问推理的图像是否较大，是否关闭结果可视化，以及服务端和客户端之间的网络带宽是否充足？

ganzhiming Aug 25, 2025

图像是1000*888的，关闭了结果可视化，你说的网络带宽最小应该多少呢？

Bobholamovic Aug 27, 2025
Maintainer

可以打印一下客户端发送请求的timestamp，以及服务端接受到请求的timstamp（在SDK的model.py中添加打印代码），计算一下两者之间的差值，如果说这个耗时比较长，说明主要是网络通信的开销，可能需要考虑传递URL、在客户端缩减图像尺寸或者优化网络通信。

stoneforever · 2025-08-26T08:37:35Z

stoneforever
Aug 26, 2025 — with giscus

shape参数有什么讲究吗？哪里可以看详细？

1 reply

Bobholamovic Aug 27, 2025
Maintainer

这是TensorRT的功能，可以参考TensorRT相关文档了解更多信息～

hssxin · 2025-08-27T03:39:51Z

hssxin
Aug 27, 2025 — with giscus

请问目前是不支持arm64版本进行高性能部署使用吗？
系统信息：
root@localhost:/work# cat /etc/os-release
NAME="Ubuntu"
VERSION="20.04.6 LTS (Focal Fossa)"
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME="Ubuntu 20.04.6 LTS"
VERSION_ID="20.04"
HOME_URL="https://www.ubuntu.com/"
SUPPORT_URL="https://help.ubuntu.com/"
BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/"
PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy"
VERSION_CODENAME=focal
UBUNTU_CODENAME=focal

对应的paddle信息：
root@localhost:/work# pip freeze | grep paddle
albucore @ https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/patched_packages/albucore-0.0.13%2Bpdx-py3-none-any.whl#sha256=6809bfdd32aa0a8a0ed927144566596d439c461f88c39fa82c044569fc0a4e52
albumentations @ https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/patched_packages/albumentations-1.4.10%2Bpdx-py3-none-any.whl#sha256=c3460492195312f67e333c87ebdd08297e6bd7fa62a8e915dbc0248f23e77174
paddle-custom-npu==3.1.1
paddle2onnx==1.3.1
paddleclas @ file:///usr/local/lib/python3.10/dist-packages/paddlex/repo_manager/repos/PaddleClas
paddlefsl==1.1.0
paddlenlp @ file:///usr/local/lib/python3.10/dist-packages/paddlex/repo_manager/repos/PaddleNLP
paddlepaddle==3.1.1
paddlesde==0.2.5
paddleslim==2.6.0
paddlespeech==1.5.0
paddlespeech-feat==0.1.0
paddlex==3.2.0
paddlex-hps-client @ file:///work/hps_sdk/paddlex_hps_OCR_sdk/client/paddlex_hps_client-0.2.0-py3-none-any.whl#sha256=b7296a71a8b0e4be587a25a2c43a2f2d0a552e5ca653b62c8f33b6d2a8400132
root@localhost:/work#

启动命令：nohup paddlex --serve --pipeline OCR --port 38080 --use_hpip > /logs/ocr.log 2>&1 &

异常问题：
I0827 11:34:41.212155 2507 init.cc:238] ENV [CUSTOM_DEVICE_ROOT]=/usr/local/lib/python3.10/dist-packages/paddle_custom_device
I0827 11:34:41.212231 2507 init.cc:146] Try loading custom device libs from: [/usr/local/lib/python3.10/dist-packages/paddle_custom_device]
I0827 11:34:41.863518 2507 custom_device_load.cc:51] Succeed in loading custom runtime in lib: /usr/local/lib/python3.10/dist-packages/paddle_custom_device/libpaddle-custom-npu.so
I0827 11:34:41.863592 2507 custom_device_load.cc:58] Skipped lib [/usr/local/lib/python3.10/dist-packages/paddle_custom_device/libpaddle-custom-npu.so]: no custom engine Plugin symbol in this lib.
I0827 11:34:41.868345 2507 custom_kernel.cc:68] Succeed in loading 359 custom kernel(s) from loaded lib(s), will be used like native ones.
I0827 11:34:41.868572 2507 init.cc:158] Finished in LoadCustomDevice with libs_path: [/usr/local/lib/python3.10/dist-packages/paddle_custom_device]
I0827 11:34:41.868618 2507 init.cc:244] CustomDevice: npu, visible devices count: 6
Traceback (most recent call last):
File "/usr/local/bin/paddlex", line 8, in
sys.exit(console_entry())
File "/usr/local/lib/python3.10/dist-packages/paddlex/main.py", line 26, in console_entry
main()
File "/usr/local/lib/python3.10/dist-packages/paddlex/paddlex_cli.py", line 481, in main
serve(
File "/usr/local/lib/python3.10/dist-packages/paddlex/paddlex_cli.py", line 380, in serve
pipeline = create_pipeline(
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/pipelines/init.py", line 166, in create_pipeline
pipeline = BasePipeline.get(pipeline_name)(
File "/usr/local/lib/python3.10/dist-packages/paddlex/utils/deps.py", line 202, in _wrapper
return old_init_func(self, *args, **kwargs)
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/pipelines/_parallel.py", line 103, in init
self._pipeline = self._create_internal_pipeline(config, self.device)
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/pipelines/_parallel.py", line 158, in _create_internal_pipeline
return self._pipeline_cls(
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/pipelines/ocr/pipeline.py", line 76, in init
self.doc_preprocessor_pipeline = self.create_pipeline(
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/pipelines/base.py", line 138, in create_pipeline
pipeline = create_pipeline(
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/pipelines/init.py", line 166, in create_pipeline
pipeline = BasePipeline.get(pipeline_name)(
File "/usr/local/lib/python3.10/dist-packages/paddlex/utils/deps.py", line 202, in _wrapper
return old_init_func(self, *args, **kwargs)
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/pipelines/_parallel.py", line 103, in init
self._pipeline = self._create_internal_pipeline(config, self.device)
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/pipelines/_parallel.py", line 158, in _create_internal_pipeline
return self._pipeline_cls(
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/pipelines/doc_preprocessor/pipeline.py", line 69, in init
self.doc_ori_classify_model = self.create_model(doc_ori_classify_config)
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/pipelines/base.py", line 105, in create_model
model = create_predictor(
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/models/init.py", line 77, in create_predictor
return BasePredictor.get(model_name)(
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/models/image_classification/predictor.py", line 49, in init
self.preprocessors, self.infer, self.postprocessors = self._build()
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/models/image_classification/predictor.py", line 82, in _build
infer = self.create_static_infer()
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/models/base/predictor/base_predictor.py", line 252, in create_static_infer
return HPInfer(
File "/usr/local/lib/python3.10/dist-packages/paddlex/utils/deps.py", line 152, in _wrapper
return old_init_func(self, *args, **kwargs)
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/models/common/static_infer.py", line 612, in init
backend, backend_config = self._determine_backend_and_config()
File "/usr/local/lib/python3.10/dist-packages/paddlex/inference/models/common/static_infer.py", line 667, in _determine_backend_and_config
raise RuntimeError(
RuntimeError: No inference backend and configuration could be suggested. Reason: 'aarch64' is not a supported architecture.
corrupted size vs. prev_size

0 replies

hssxin · 2025-08-27T04:53:36Z

hssxin
Aug 27, 2025 — with giscus

root@localhost:/work# paddlex --pipeline OCR \

    --input http://localhost/downloaded_image.jpg \
    --use_doc_orientation_classify False \
    --use_doc_unwarping False \
    --use_textline_orientation False \
    --save_path ./output \
    --device npu:0

Creating model: ('PP-LCNet_x1_0_doc_ori', None)
Using official model (PP-LCNet_x1_0_doc_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 7.10it/s]
I0827 12:49:54.310241 32390 init.cc:238] ENV [CUSTOM_DEVICE_ROOT]=/usr/local/lib/python3.10/dist-packages/paddle_custom_device
I0827 12:49:54.310293 32390 init.cc:146] Try loading custom device libs from: [/usr/local/lib/python3.10/dist-packages/paddle_custom_device]
I0827 12:49:54.986598 32390 custom_device_load.cc:51] Succeed in loading custom runtime in lib: /usr/local/lib/python3.10/dist-packages/paddle_custom_device/libpaddle-custom-npu.so
I0827 12:49:54.986655 32390 custom_device_load.cc:58] Skipped lib [/usr/local/lib/python3.10/dist-packages/paddle_custom_device/libpaddle-custom-npu.so]: no custom engine Plugin symbol in this lib.
I0827 12:49:54.991463 32390 custom_kernel.cc:68] Succeed in loading 359 custom kernel(s) from loaded lib(s), will be used like native ones.
I0827 12:49:54.991686 32390 init.cc:158] Finished in LoadCustomDevice with libs_path: [/usr/local/lib/python3.10/dist-packages/paddle_custom_device]
I0827 12:49:54.991734 32390 init.cc:244] CustomDevice: npu, visible devices count: 6
Creating model: ('UVDoc', None)
Using official model (UVDoc), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 6.70it/s]
Creating model: ('PP-LCNet_x1_0_textline_ori', None)
Using official model (PP-LCNet_x1_0_textline_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 6.85it/s]
Creating model: ('PP-OCRv5_server_det', None)
Using official model (PP-OCRv5_server_det), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 6.31it/s]
Creating model: ('PP-OCRv5_server_rec', None)
Using official model (PP-OCRv5_server_rec), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:01<00:00, 2.66it/s]
Connecting to http://localhost/downloaded_image.jpg ...
Downloading downloaded_image.jpg ...
[==================================================] 100.00%
{'res': {'input_path': '/root/.paddlex/predict_input/downloaded_image.jpg', 'page_index': None, 'model_settings': {'use_doc_preprocessor': False, 'use_textline_orientation': False}, 'dt_polys': array([], dtype=float64), 'text_det_params': {'limit_side_len': 64, 'limit_type': 'min', 'thresh': 0.3, 'max_side_limit': 4000, 'box_thresh': 0.6, 'unclip_ratio': 1.5}, 'text_type': 'general', 'text_rec_score_thresh': 0.0, 'return_word_box': False, 'rec_texts': [], 'rec_scores': array([], dtype=float64), 'rec_polys': array([], dtype=float64), 'rec_boxes': array([], dtype=float64)}}

图片为什么使用官方的命令OCR识别图片也没有有看到文字提取是那个地方有问题？，但是如果不使用NPU是可以解析对应的图片的：https://img1.baidu.com/it/u=1855070411,442203363&fm=253&app=138&f=JPEG?w=800&h=1422 图片地址如下

0 replies

hssxin · 2025-08-27T05:12:44Z

hssxin
Aug 27, 2025 — with giscus

直接用命令对比就可以看出问题请问是什么地方有问题？

root@localhost:/work# paddlex --pipeline OCR \

    --input https://wx4.sinaimg.cn/mw690/008CmN0fly1hv6jt7i2nsj30u011r7wh.jpg \
    --use_doc_orientation_classify False \
    --use_doc_unwarping False \
    --use_textline_orientation False \
    --save_path ./output \
    --device npu:0

Creating model: ('PP-LCNet_x1_0_doc_ori', None)
Using official model (PP-LCNet_x1_0_doc_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 6.09it/s]
I0827 13:07:46.373286 37003 init.cc:238] ENV [CUSTOM_DEVICE_ROOT]=/usr/local/lib/python3.10/dist-packages/paddle_custom_device
I0827 13:07:46.373381 37003 init.cc:146] Try loading custom device libs from: [/usr/local/lib/python3.10/dist-packages/paddle_custom_device]
I0827 13:07:47.009894 37003 custom_device_load.cc:51] Succeed in loading custom runtime in lib: /usr/local/lib/python3.10/dist-packages/paddle_custom_device/libpaddle-custom-npu.so
I0827 13:07:47.009954 37003 custom_device_load.cc:58] Skipped lib [/usr/local/lib/python3.10/dist-packages/paddle_custom_device/libpaddle-custom-npu.so]: no custom engine Plugin symbol in this lib.
I0827 13:07:47.014819 37003 custom_kernel.cc:68] Succeed in loading 359 custom kernel(s) from loaded lib(s), will be used like native ones.
I0827 13:07:47.015035 37003 init.cc:158] Finished in LoadCustomDevice with libs_path: [/usr/local/lib/python3.10/dist-packages/paddle_custom_device]
I0827 13:07:47.015079 37003 init.cc:244] CustomDevice: npu, visible devices count: 6
Creating model: ('UVDoc', None)
Using official model (UVDoc), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 6.62it/s]
Creating model: ('PP-LCNet_x1_0_textline_ori', None)
Using official model (PP-LCNet_x1_0_textline_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 6.94it/s]
Creating model: ('PP-OCRv5_server_det', None)
Using official model (PP-OCRv5_server_det), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 6.71it/s]
Creating model: ('PP-OCRv5_server_rec', None)
Using official model (PP-OCRv5_server_rec), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:01<00:00, 2.75it/s]
Connecting to https://wx4.sinaimg.cn/mw690/008CmN0fly1hv6jt7i2nsj30u011r7wh.jpg ...
Downloading 008CmN0fly1hv6jt7i2nsj30u011r7wh.jpg ...
[==================================================] 100.00%
{'res': {'input_path': '/root/.paddlex/predict_input/008CmN0fly1hv6jt7i2nsj30u011r7wh.jpg', 'page_index': None, 'model_settings': {'use_doc_preprocessor': False, 'use_textline_orientation': False}, 'dt_polys': array([], dtype=float64), 'text_det_params': {'limit_side_len': 64, 'limit_type': 'min', 'thresh': 0.3, 'max_side_limit': 4000, 'box_thresh': 0.6, 'unclip_ratio': 1.5}, 'text_type': 'general', 'text_rec_score_thresh': 0.0, 'return_word_box': False, 'rec_texts': [], 'rec_scores': array([], dtype=float64), 'rec_polys': array([], dtype=float64), 'rec_boxes': array([], dtype=float64)}}
free(): invalid pointer

C++ Traceback (most recent call last):

No stack trace in paddle, may be caused by external reasons.

Error Message Summary:

FatalError: Process abort signal is detected by the operating system.
[TimeInfo: *** Aborted at 1756271311 (unix time) try "date -d @1756271311" if you are using GNU date ***]
[SignalInfo: *** SIGABRT (@0x908b) received by PID 37003 (TID 0xffffb909c010) from PID 37003 ***]

Aborted (core dumped)
root@localhost:/work# paddlex --pipeline OCR --input general_ocr_002.png --use_doc_orientation_classify False --use_doc_unwarping False --use_textline_orientation False --save_path ./output
Creating model: ('PP-LCNet_x1_0_doc_ori', None)
Using official model (PP-LCNet_x1_0_doc_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 0%| | 0.00/5.00 [00:00<?, ?it/s]^Z
[23]+ Stopped paddlex --pipeline OCR --input general_ocr_002.png --use_doc_orientation_classify False --use_doc_unwarping False --use_textline_orientation False --save_path ./output
root@localhost:/work# paddlex --pipeline OCR \

    --input https://wx4.sinaimg.cn/mw690/008CmN0fly1hv6jt7i2nsj30u011r7wh.jpg \
    --use_doc_orientation_classify False \
    --use_doc_unwarping False \
    --use_textline_orientation False \
    --save_path ./output

Creating model: ('PP-LCNet_x1_0_doc_ori', None)
Using official model (PP-LCNet_x1_0_doc_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 6.97it/s]
I0827 13:09:19.245047 41819 init.cc:238] ENV [CUSTOM_DEVICE_ROOT]=/usr/local/lib/python3.10/dist-packages/paddle_custom_device
I0827 13:09:19.245098 41819 init.cc:146] Try loading custom device libs from: [/usr/local/lib/python3.10/dist-packages/paddle_custom_device]
I0827 13:09:19.921327 41819 custom_device_load.cc:51] Succeed in loading custom runtime in lib: /usr/local/lib/python3.10/dist-packages/paddle_custom_device/libpaddle-custom-npu.so
I0827 13:09:19.921386 41819 custom_device_load.cc:58] Skipped lib [/usr/local/lib/python3.10/dist-packages/paddle_custom_device/libpaddle-custom-npu.so]: no custom engine Plugin symbol in this lib.
I0827 13:09:19.926184 41819 custom_kernel.cc:68] Succeed in loading 359 custom kernel(s) from loaded lib(s), will be used like native ones.
I0827 13:09:19.926398 41819 init.cc:158] Finished in LoadCustomDevice with libs_path: [/usr/local/lib/python3.10/dist-packages/paddle_custom_device]
I0827 13:09:19.926447 41819 init.cc:244] CustomDevice: npu, visible devices count: 6
Creating model: ('UVDoc', None)
Using official model (UVDoc), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 7.50it/s]
Creating model: ('PP-LCNet_x1_0_textline_ori', None)
Using official model (PP-LCNet_x1_0_textline_ori), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 7.20it/s]
Creating model: ('PP-OCRv5_server_det', None)
Using official model (PP-OCRv5_server_det), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:00<00:00, 6.66it/s]
Creating model: ('PP-OCRv5_server_rec', None)
Using official model (PP-OCRv5_server_rec), the model files will be automatically downloaded and saved in /root/.paddlex/official_models.
Processing 5 items: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 5.00/5.00 [00:01<00:00, 3.94it/s]
Connecting to https://wx4.sinaimg.cn/mw690/008CmN0fly1hv6jt7i2nsj30u011r7wh.jpg ...
Downloading 008CmN0fly1hv6jt7i2nsj30u011r7wh.jpg ...
[==================================================] 100.00%
{'res': {'input_path': '/root/.paddlex/predict_input/008CmN0fly1hv6jt7i2nsj30u011r7wh.jpg', 'page_index': None, 'model_settings': {'use_doc_preprocessor': False, 'use_textline_orientation': False}, 'dt_polys': array([[[ 62, 0],
...,
[ 62, 27]],

   ...,

   [[283, 841],
    ...,
    [283, 862]]], dtype=int16), 'text_det_params': {'limit_side_len': 64, 'limit_type': 'min', 'thresh': 0.3, 'max_side_limit': 4000, 'box_thresh': 0.6, 'unclip_ratio': 1.5}, 'text_type': 'general', 'textline_orientation_angles': array([-1, ..., -1]), 'text_rec_score_thresh': 0.0, 'return_word_box': False, 'rec_texts': ['上海虹桥站', 'G28', '北京南站', 'Shanghaihongqiao', 'Beijingnan', '2024年10月25日19:00开', '14车12F号', '￥626.0元', '惠', '二等座', '仅供报销使用', 'Z84S058947', '检票：10A、10B', '北京南站', 'G11', '上海虹桥站', 'Beijingnan', 'Shanghaihongqiao', '2024年10月06日12:00开', '06车04F号', '￥662.0元', '二等座', '仅供报销使用', 'Z84S058946', '检票：1AB', '上海虹桥站', 'G22', '北京南站', 'Shanghaihongqiao', 'Beijingnan', '2024年10月01日16:00开', '06车12F号', '￥662.0元', '二等座', '检票：1AB', 'Z88G008231', 'G28', '北京南站', '上海虹桥站', 'Shanghaihongqiao', 'Beijingnan', '2024年09月14日19:00开', '07车08F号', '惠', '二等座', '￥626.0元', '仅供报销使用', '@午夜不睡说个机'], 'rec_scores': array([0.9968766 , ..., 0.99799263]), 'rec_polys': array([[[ 62,   0],
    ...,
    [ 62,  27]],

   ...,

   [[283, 841],
    ...,
    [283, 862]]], dtype=int16), 'rec_boxes': array([[ 62, ...,  27],
   ...,
   [283, ..., 862]], dtype=int16)}}

corrupted size vs. prev_size

0 replies

jingdongHe · 2025-08-31T07:59:31Z

jingdongHe
Aug 31, 2025 — with giscus

经验证H20使用ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/hps:paddlex3.2-gpu镜像（cuda11.8），会出现无法识别图片的情况（即：识别的图片是空的、返回的rec_texts也是空的），请问是否有支持cuda12.6的镜像可以使用？

3 replies

SuiyueYoung Aug 31, 2025

有12.6镜像用的，链接和11.8在一起的呀
paddlex3.2.0-paddlepaddle3.0.0-gpu-cuda12.6-cudnn9.5-trt10.5
paddlex3.0.1-paddlepaddle3.0.0-gpu-cuda12.6-cudnn9.5-trt10.5

jingdongHe Aug 31, 2025 — with giscus

不行，用这个镜像跑server.sh命令的时候，会报server.sh: line 27: exec: tritonserver: not found
ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlex/paddlex:paddlex3.2.1-paddlepaddle3.0.0-gpu-cuda12.6-cudnn9.5-trt10.5

Little-Star888 Sep 2, 2025 — with giscus

12.6的要自己自定义构建dockerfile的，目前官方hps镜像只支持cuda11.8的

SuiyueYoung · 2025-08-31T08:31:02Z

SuiyueYoung
Aug 31, 2025

目前 50 系显卡是不是依旧不支持高性能推理以及高稳定性部署？

1 reply

Bobholamovic Sep 1, 2025
Maintainer

是的，暂不支持

tatocode · 2025-09-04T03:46:16Z

tatocode
Sep 4, 2025 — with giscus

你好，我正在v100服务器上使用高稳定性服务化部署OCR服务，实例配置情况为：

  1 backend: "python"
  2 max_batch_size: 16
  3 input [
  4   {
  5     name: "input"
  6     data_type: TYPE_STRING
  7     dims: [ 1 ]
  8   }
  9 ]
 10 output [
 11   {
 12     name: "output"
 13     data_type: TYPE_STRING
 14     dims: [ 1 ]
 15   }
 16 ]
 17 instance_group [
 18   {
 19       count: 2
 20       kind: KIND_GPU
 21       gpus: [ 2, 3, 4 ]
 22   }
 23 ]

实例可以正常运行。但当我使用nvidia-smi实时监控gpu占用时，发现在异步并发调用接口（grpc方式）时，始终只能在一张卡上进行推理计算，现象是：在gpu 2上推理几秒钟，切到gpu 3上推理几秒钟……始终无法在多块gpu上并行计算，请问这个是什么原因，有无解决办法呢？

5 replies

Bobholamovic Sep 4, 2025
Maintainer

请问可以提供一下调用脚本吗？

tatocode Sep 4, 2025 — with giscus

可以的，我是采用fastapi封装成api服务进行使用的，每次调用接口时传入一张图片的base64编码字符串。理论上fastapi接口可以支持多个请求同时访问接口，应该是并发独立调用send_post函数。

def send_post(file: str):
    data_json: dict = {"file": file, "fileType": 1, "visualize": False}

    output = triton_request(client, "ocr", data_json)
    if output["errorCode"] != 0:
        return [], []
    result = output["result"]["ocrResults"][0]["prunedResult"]
    return result["rec_texts"], result["dt_polys"]


@app.post("/realtime/ocrrec", response_model=RspImg)
async def ocr_rec(item: Item):
    content, poly = send_post(item.img)

    cif = ChineseInfoValidator()
    ocrs: list[dict] = []
    for c, p in zip(content, poly):
        ocrs.append({"content": c, "position": p})
    entities = cif.extract_and_validate("\n".join(content))
    return {"ocr": ocrs, "entities": entities}

Bobholamovic Sep 4, 2025
Maintainer

我猜测有可能是client限制了并发，可以考虑尝试创建多个client，看看是否可以实现并发。

tatocode Sep 4, 2025 — with giscus

谢谢🤣。
在重构源码中paddlex_hps_client/request.triton_request函数后，问题得到了解决。

async def async_triton_request(client, model_name, data, *, request_kwargs=None):
    loop = asyncio.get_running_loop()
    future = loop.create_future()

    def callback(result, error):
        if error:
            loop.call_soon_threadsafe(future.set_exception, RuntimeError(error))
        else:
            loop.call_soon_threadsafe(future.set_result, result)

    if request_kwargs is None:
        request_kwargs = {}

    input_ = triton_grpc.InferInput(constants.INPUT_NAME, [1, 1], "BYTES")
    input_.set_data_from_numpy(_create_triton_input(data))

    client.async_infer(model_name, inputs=[input_], callback=callback, **request_kwargs)

    result = await future
    output = result.as_numpy(constants.OUTPUT_NAME)
    return _parse_triton_output(output)

问题得到了解决。

Bobholamovic Sep 4, 2025
Maintainer

你说得对，之前在异步函数里调用同步方法，而且是比较耗时的同步方法，会阻塞住事件循环。另一种可能的解决方案是使用output = await asyncio.run_in_executor(None, triton_request, "ocr", data_json)（或者新版本的asyncio.to_thread）代替output = triton_request(client, "ocr", data_json)，让任务在背景线程池中执行，避免阻塞事件循环。不过，这需要triton的client对象本身有线程安全的保证，我对这一点不太确定～

PaddleX/latest/pipeline_deploy/serving #3524

Uh oh!

giscus[bot] bot Mar 4, 2025

PaddleX/latest/pipeline_deploy/serving

Replies: 29 comments · 114 replies

Uh oh!

Danee-wawawa Mar 4, 2025 — with giscus

Uh oh!

gaoxiangpost Mar 8, 2025 — with giscus

Uh oh!

crazy-1677 Mar 12, 2025 — with giscus

Uh oh!

cuicheng01 Mar 15, 2025 — with giscus Maintainer

Uh oh!

heliang230 Apr 7, 2025 — with giscus

Uh oh!

Bobholamovic Apr 28, 2025 — with giscus Maintainer

Uh oh!

yaoguang Apr 8, 2025 — with giscus

Uh oh!

Bobholamovic Apr 28, 2025 — with giscus Maintainer

Uh oh!

wxz5459 May 29, 2025 — with giscus

Uh oh!

Bobholamovic May 30, 2025 Maintainer

Uh oh!

Danee-wawawa May 30, 2025

Uh oh!

Bobholamovic May 30, 2025 Maintainer

Uh oh!

LonerangerLR Jun 3, 2025 — with giscus

Uh oh!

Uh oh!

Bobholamovic Jun 13, 2025 Maintainer

Uh oh!

wu2754522801 Jun 3, 2025 — with giscus

Uh oh!

Bobholamovic Jun 13, 2025 Maintainer

Uh oh!

wu2754522801 Jun 18, 2025 — with giscus

Uh oh!

Bobholamovic Jun 18, 2025 Maintainer

Uh oh!

lilong1988 Jun 13, 2025 — with giscus

Uh oh!

Bobholamovic Jun 13, 2025 Maintainer

Uh oh!

yaohongfenglove Jul 21, 2025 — with giscus

Uh oh!

Bobholamovic Jul 21, 2025 Maintainer

Uh oh!

yaohongfenglove Jul 21, 2025 — with giscus

Uh oh!

Bobholamovic Jul 21, 2025 Maintainer

Uh oh!

ChangDong001 Jun 18, 2025 — with giscus

Uh oh!

Bobholamovic Jun 18, 2025 Maintainer

Uh oh!

wqcai Jul 10, 2025 — with giscus

Uh oh!

Uh oh!

Bobholamovic Jul 10, 2025 Maintainer

Uh oh!

wqcai Jul 17, 2025 — with giscus

Uh oh!

Bobholamovic Jul 17, 2025 Maintainer

Uh oh!

wqcai Jul 17, 2025 — with giscus

Uh oh!

Uh oh!

giscus[bot]
bot Mar 4, 2025

Replies: 29 comments 114 replies

Danee-wawawa
Mar 4, 2025 — with giscus

crazy-1677
Mar 12, 2025 — with giscus

cuicheng01 Mar 15, 2025 — with giscus
Maintainer

heliang230
Apr 7, 2025 — with giscus

Bobholamovic Apr 28, 2025 — with giscus
Maintainer

yaoguang
Apr 8, 2025 — with giscus

Bobholamovic Apr 28, 2025 — with giscus
Maintainer

wxz5459
May 29, 2025 — with giscus

Bobholamovic May 30, 2025
Maintainer

Bobholamovic May 30, 2025
Maintainer

LonerangerLR
Jun 3, 2025 — with giscus

Bobholamovic Jun 13, 2025
Maintainer

wu2754522801
Jun 3, 2025 — with giscus

Bobholamovic Jun 13, 2025
Maintainer

Bobholamovic Jun 18, 2025
Maintainer

lilong1988
Jun 13, 2025 — with giscus

Bobholamovic Jun 13, 2025
Maintainer

Bobholamovic Jul 21, 2025
Maintainer

Bobholamovic Jul 21, 2025
Maintainer

ChangDong001
Jun 18, 2025 — with giscus

Bobholamovic Jun 18, 2025
Maintainer

wqcai
Jul 10, 2025 — with giscus

Bobholamovic Jul 10, 2025
Maintainer

Bobholamovic Jul 17, 2025
Maintainer