Release v3.3.13 · PaddlePaddle/PaddleX

2026.1.12 v3.3.13 released

PaddleOCR-VL adds a new concatenate_pages method for concatenating multi-page parsing results, supporting preservation of multi-level heading structures and merging tables that span across pages.
GenAIClient now supports specifying a custom model name during construction.
PaddleOCR-VL-0.9B supports local inference and allows passing min_pixels and max_pixels parameters on each prediction.
@alealexpro100 fixed an issue where the cyrillic_PP-OCRv5_mobile_rec model could not enable high-performance inference under PaddlePaddle 3.1.1 + CUDA 12.
@szepeviktor fixed an issue where the width and height were displayed in the wrong order in logs when the image size exceeded the limit for text detection models.
@metax666 @duqimeng PP-StructureV3 now supports running on MetaX GPUs.
Added support for using the internally integrated PaddleOCR-VL-0.9B in vLLM and SGLang (requires newer versions of vLLM and SGLang).
Fixed errors in the application packaging documentation and updated sample code to be compatible with the latest APIs.

PaddleOCR-VL 新增 concatenate_pages 方法，用于拼接多页解析结果，支持保留多级标题结构、合并跨页表格。
GenAIClient在构造时支持指定自定义模型名称。
PaddleOCR-VL-0.9B 支持本地推理，并允许在每次预测时传入 min_pixels 和 max_pixels 参数。
@alealexpro100 修复了 cyrillic_PP-OCRv5_mobile_rec 模型在 PaddlePaddle 3.1.1 + CUDA 12 环境下无法启用高性能推理的问题。
@szepeviktor 修复了文本检测模型在图像尺寸超过限制时，日志中宽高显示顺序错误的问题。
@metax666 @duqimeng PP-StructureV3 现已支持在沐曦 GPU 上运行。
支持使用 vLLM、SGLang 内部集成的 PaddleOCR-VL-0.9B（依赖高版本 vLLM 和 SGLang）。
修复应用打包文档中的错误，更新示例代码以适配最新接口。

Full Changelog: v3.3.12...v3.3.13