v3.3.13
·
2 commits
to release/3.3
since this release
2026.1.12 v3.3.13 released
- PaddleOCR-VL adds a new
concatenate_pagesmethod for concatenating multi-page parsing results, supporting preservation of multi-level heading structures and merging tables that span across pages. - GenAIClient now supports specifying a custom model name during construction.
- PaddleOCR-VL-0.9B supports local inference and allows passing
min_pixelsandmax_pixelsparameters on each prediction. - @alealexpro100 fixed an issue where the
cyrillic_PP-OCRv5_mobile_recmodel could not enable high-performance inference under PaddlePaddle 3.1.1 + CUDA 12. - @szepeviktor fixed an issue where the width and height were displayed in the wrong order in logs when the image size exceeded the limit for text detection models.
- @metax666 @duqimeng PP-StructureV3 now supports running on MetaX GPUs.
- Added support for using the internally integrated PaddleOCR-VL-0.9B in vLLM and SGLang (requires newer versions of vLLM and SGLang).
- Fixed errors in the application packaging documentation and updated sample code to be compatible with the latest APIs.
2025.1.12 v3.3.13 发布
- PaddleOCR-VL 新增
concatenate_pages方法,用于拼接多页解析结果,支持保留多级标题结构、合并跨页表格。 GenAIClient在构造时支持指定自定义模型名称。- PaddleOCR-VL-0.9B 支持本地推理,并允许在每次预测时传入
min_pixels和max_pixels参数。 - @alealexpro100 修复了
cyrillic_PP-OCRv5_mobile_rec模型在 PaddlePaddle 3.1.1 + CUDA 12 环境下无法启用高性能推理的问题。 - @szepeviktor 修复了文本检测模型在图像尺寸超过限制时,日志中宽高显示顺序错误的问题。
- @metax666 @duqimeng PP-StructureV3 现已支持在沐曦 GPU 上运行。
- 支持使用 vLLM、SGLang 内部集成的 PaddleOCR-VL-0.9B(依赖高版本 vLLM 和 SGLang)。
- 修复应用打包文档中的错误,更新示例代码以适配最新接口。
Full Changelog: v3.3.12...v3.3.13