Releases: PaddlePaddle/PaddleOCR
v3.2.0
2025.8.21 v3.2.0 released
-
Significant Model Additions:
- Introduced training, inference, and deployment for PP-OCRv5 recognition models in English, Thai, and Greek. The PP-OCRv5 English model delivers an 11% improvement in English scenarios compared to the main PP-OCRv5 model, with the Thai and Greek recognition models achieving accuracies of 82.68% and 89.28%, respectively.
-
Deployment Capability Upgrades:
- Full support for PaddlePaddle framework versions 3.1.0 and 3.1.1.
- Comprehensive upgrade of the PP-OCRv5 C++ local deployment solution, now supporting both Linux and Windows, with feature parity and identical accuracy to the Python implementation.
- High-performance inference now supports CUDA 12, and inference can be performed using either the Paddle Inference or ONNX Runtime backends.
- The high-stability service-oriented deployment solution is now fully open-sourced, allowing users to customize Docker images and SDKs as required.
- The high-stability service-oriented deployment solution also supports invocation via manually constructed HTTP requests, enabling client-side code development in any programming language.
-
Benchmark Support:
- All production lines now support fine-grained benchmarking, enabling measurement of end-to-end inference time as well as per-layer and per-module latency data to assist with performance analysis.
- Documentation has been updated to include key metrics for commonly used configurations on mainstream hardware, such as inference latency and memory usage, providing deployment references for users.
-
Bug Fixes:
- Resolved the issue of failed log saving during model training.
- Upgraded the data augmentation component for formula models for compatibility with newer versions of the albumentations dependency, and fixed deadlock warnings when using the tokenizers package in multi-process scenarios.
- Fixed inconsistencies in switch behaviors (e.g.,
use_chart_parsing
) in the PP-StructureV3 configuration files compared to other pipelines.
-
Other Enhancements:
- Separated core and optional dependencies. Only minimal core dependencies are required for basic text recognition; additional dependencies for document parsing and information extraction can be installed as needed.
- Enabled support for NVIDIA RTX 50 series graphics cards on Windows; users can refer to the installation guide for the corresponding PaddlePaddle framework versions.
- PP-OCR series models now support returning single-character coordinates.
- Added AIStudio, ModelScope, and other model download sources, allowing users to specify the source for model downloads.
- Added support for chart-to-table conversion via the PP-Chart2Table module.
- Optimized documentation descriptions to improve usability.
2025.8.21 v3.2.0 发布
-
重要模型新增:
- 新增 PP-OCRv5 英文、泰文、希腊文识别模型的训练、推理、部署。其中 PP-OCRv5 英文模型较 PP-OCRv5 主模型在英文场景提升 11%,泰文识别模型精度 82.68%,希腊文识别模型精度 89.28%。
-
部署能力升级:
- 全面支持飞桨框架 3.1.0 和 3.1.1 版本。
- 全面升级 PP-OCRv5 C++ 本地部署方案,支持 Linux、Windows,功能及精度效果与 Python 方案保持一致。
- 高性能推理支持 CUDA 12,可使用 Paddle Inference、ONNX Runtime 后端推理。
- 高稳定性服务化部署方案全面开源,支持用户根据需求对 Docker 镜像和 SDK 进行定制化修改。
- 高稳定性服务化部署方案支持通过手动构造HTTP请求的方式调用,该方式允许客户端代码使用任意编程语言编写。
-
Benchmark支持:
- 全部产线支持产线细粒度 benchmark,能够测量产线端到端推理时间以及逐层、逐模块的耗时数据,可用于辅助产线性能分析。
- 文档中补充各产线常用配置在主流硬件上的关键指标,包括推理耗时和内存占用等,为用户部署提供参考。
-
Bug修复:
- 修复模型训练时训练日志保存失败的问题。
- 对公式模型的数据增强部分进行了版本兼容性升级,以适应新版本的 albumentations 依赖,并修复了在多进程使用 tokenizers 依赖包时出现的死锁警告。
- 修复 PP-StructureV3 配置文件中的
use_chart_parsing
等开关行为与其他产线不统一的问题。
-
其他升级:
- 分离必要依赖与可选依赖。使用基础文字识别功能时,仅需安装少量核心依赖;若需文档解析、信息抽取等功能,用户可按需选择安装额外依赖。
- 支持 Windows 用户使用英伟达 50 系显卡,可根据安装文档安装对应版本的 paddle 框架。
- PP-OCR 系列模型支持返回单文字坐标。
- 模型新增 AIStudio、ModelScope 等下载源。可指定相关下载源下载对应的模型。
- 支持图表转表 PP-Chart2Table 单功能模块推理能力。
- 优化部分使用文档中的描述,提升易用性。
New Contributors
Full Changelog: v3.1.1...v3.2.0
v3.1.1
2025.8.15 v3.1.1 released
-
Bug Fixes:
- Added the missing methods
save_vector
,save_visual_info_list
,load_vector
, andload_visual_info_list
in thePP-ChatOCRv4
class. - Added the missing parameters
glossary
andllm_request_interval
to thetranslate
method in thePPDocTranslation
class.
- Added the missing methods
-
Documentation Improvements:
- Added a demo to the MCP documentation.
- Added information about the PaddlePaddle and PaddleOCR version used for performance metrics testing in the documentation.
- Fixed errors and omissions in the production line document translation.
-
Others:
- Changed the MCP server dependency to use the pure Python library
puremagic
instead ofpython-magic
to reduce installation issues. - Retested PP-OCRv5 performance metrics with PaddleOCR version 3.1.0 and updated the documentation.
- Changed the MCP server dependency to use the pure Python library
2025.8.15 v3.1.1 发布
-
bug修复:
- 补充
PP-ChatOCRv4
类缺失的save_vector
、save_visual_info_list
、load_vector、load_visual_info_list
方法。 - 补充
PPDocTranslation
类的translate
方法缺失的glossary 和
llm_request_interval 参数。
- 补充
-
文档优化:
- 补充 MCP 文档中的 demo。
- 补充文档中测试性能指标使用的飞桨框架与 PaddleOCR 版本。
- 修复文档翻译产线文档中的错漏。
-
其他:
- 修改 MCP 服务器依赖,使用纯 Python 库
puremagic
代替python-magic
,减少安装问题。 - 使用 3.1.0 版本 PaddleOCR 重新测试 PP-OCRv5 性能指标,更新文档。
- 修改 MCP 服务器依赖,使用纯 Python 库
Full Changelog: v3.1.0...v3.1.1
v3.1.0
2025.6.29 v3.1.0 released
-
Key Models and Pipelines:
- Added PP-OCRv5 Multilingual Text Recognition Model, which supports the training and inference process for text recognition models in 37 languages, including French, Spanish, Portuguese, Russian, Korean, etc. Average accuracy improved by over 30%. Details
- Upgraded the PP-Chart2Table model in PP-StructureV3, further enhancing the capability of converting charts to tables. On internal custom evaluation sets, the metric (RMS-F1) increased by 9.36 percentage points (71.24% -> 80.60%).
- Newly launched document translation pipeline, PP-DocTranslation, based on PP-StructureV3 and ERNIE 4.5 Turbo, which supports the translation of Markdown format documents, various complex-layout PDF documents, and document images, with the results saved as Markdown format documents. Details
-
New MCP server: Details
- Supports both OCR and PP-StructureV3 pipelines.
- Supports three working modes: local Python library, AIStudio Community Cloud Service, and self-hosted service.
- Supports invoking local services via stdio and remote services via Streamable HTTP.
-
Documentation Optimization: Improved the descriptions in some user guides for a smoother reading experience.
2025.6.29 v3.1.0 发布
-
重要模型和产线:
- 新增 PP-OCRv5 多语种文本识别模型,支持法语、西班牙语、葡萄牙语、俄语、韩语等 37 种语言的文字识别模型的训推流程。平均精度涨幅超30%。详情
- 升级 PP-StructureV3 中的 PP-Chart2Table 模型,图表转表能力进一步升级,在内部自建测评集合上指标(RMS-F1)提升 9.36 个百分点(71.24% -> 80.60%)。
- 新增基于 PP-StructureV3 和 ERNIE 4.5 Turbo 的文档翻译产线 PP-DocTranslation,支持翻译 Markdown 格式文档、各种复杂版式的 PDF 文档和文档图像,结果保存为 Markdown 格式文档。详情
-
新增MCP server:详情
- 支持 OCR 和 PP-StructureV3 两种工具;
- 支持本地Python库、星河社区云服务、自托管服务三种工作模式;
- 支持通过 stdio 调用本地服务,通过 Streamable HTTP 调用远程服务。
-
文档优化: 优化了部分使用文档描述,提升阅读体验。
v3.0.3
- Bug修复:
- 修复
enable_mkldnn
参数不生效的问题,恢复CPU默认使用MKL-DNN推理的行为。 - 随PaddleX 3.0.3 版本的其他修复
- 修复
v3.0.2
-
功能新增:
- 模型默认下载源从
BOS
改为HuggingFace
,同时也支持用户通过更改环境变量PADDLE_PDX_MODEL_SOURCE
为BOS
,将模型下载源设置为百度云对象存储BOS。 - PP-OCRv5、PP-StructureV3、PP-ChatOCRv4等pipeline新增C++、Java、Go、C#、Node.js、PHP 6种语言的服务调用示例。
- 优化PP-StructureV3产线中版面分区排序算法,对复杂竖版版面排序逻辑进行完善,进一步提升了复杂版面排序效果。
- 优化模型选择逻辑,当指定语言、未指定模型版本时,自动选择支持该语言的最新版本的模型。
- 为MKL-DNN缓存大小设置默认上界,防止缓存无限增长。同时,支持用户配置缓存容量。
- 更新高性能推理默认配置,支持Paddle MKL-DNN加速。优化高性能推理自动配置逻辑,支持更智能的配置选择。
- 调整默认设备获取逻辑,考虑环境中安装的Paddle框架对计算设备的实际支持情况,使程序行为更符合直觉。
- 新增PP-OCRv5的Android端示例,详情。
- 模型默认下载源从
-
Bug修复:
- 修复PP-StructureV3部分CLI参数不生效的问题。
- 修复部分情况下
export_paddlex_config_to_yaml
无法正常工作的问题。 - 修复save_path实际行为与文档描述不符的问题。
- 修复基础服务化部署在使用MKL-DNN时可能出现的多线程错误。
- 修复Latex-OCR模型的图像预处理的通道顺序错误。
- 修复文本识别模块保存可视化图像的通道顺序错误。
- 修复PP-StructureV3中表格可视化结果通道顺序错误。
- 修复PP-StructureV3产线中极特殊的情况下,计算overlap_ratio时,变量溢出问题。
-
文档优化:
- 更新文档中对
enable_mkldnn
参数的说明,使其更准确地描述程序的实际行为。 - 修复文档中对
lang
和ocr_version
参数描述的错误。 - 补充通过CLI导出产线配置文件的说明。
- 修复PP-OCRv5性能数据表格中的列缺失问题。
- 润色PP-StructureV3在不同配置下的benchmark指标。
- 更新文档中对
-
其他:
- 放松numpy、pandas等依赖的版本限制,恢复对Python 3.12的支持。
v3.0.1
- 优化部分模型和模型配置:
- 更新 PP-OCRv5默认模型配置,检测和识别均由mobile改为server模型。为了改善大多数的场景默认效果,配置中的参数
limit_side_len
由736改为64 - 新增文本行方向分类
PP-LCNet_x1_0_textline_ori
模型,精度99.42%,OCR、PP-StructureV3、PP-ChatOCRv4产线的默认文本行方向分类器改为该模型 - 优化文本行方向分类
PP-LCNet_x0_25_textline_ori
模型,精度提升3.3个百分点,当前精度98.85%
- 更新 PP-OCRv5默认模型配置,检测和识别均由mobile改为server模型。为了改善大多数的场景默认效果,配置中的参数
- 优化3.0.0版本部分存在的问题
- 优化CLI使用体验: 当使用PaddleOCR CLI不传入任何参数时,给出用法提示。
- 新增参数: PP-ChatOCRv3、PP-StructureV3支持
use_textline_orientation
参数。 - CPU推理速度优化: 所有产线CPU推理默认开启MKL-DNN。
- C++推理支持: PP-OCRv5的检测和识别串联部分支持C++推理
- 修复3.0.0版本部分存在的问题
- 修复由于公式识别、表格识别模型无法使用MKL-DNN导致PP-StructureV3在部分cpu推理报错的问题
- 修复在部分GPU环境中推理报
FatalError: Process abort signal is detected by the operating system
错误的问题 - 修复部分Python3.8环境的type hint的问题
- 修复
PPStructureV3.concatenate_markdown_pages
方法不存在的问题。 - 修复实例化
paddleocr.PaddleOCR
时同时指定lang
和model_name
时model_name
不生效的问题。
v3.0.0
-
发布全场景文字识别模型PP-OCRv5: 单模型支持五种文字类型和复杂手写体识别;整体识别精度相比上一代提升13个百分点。
-
发布通用文档解析方案PP-StructureV3: 支持多场景、多版式 PDF 高精度解析,在公开评测集中领先众多开源和闭源方案。
-
发布智能文档理解方案PP-ChatOCRv4: 原生支持文心大模型4.5 Turbo,精度相比上一代提升15个百分点。
-
重构部署能力,统一推理接口: PaddleOCR 3.0 融合了飞桨 PaddleX3.0 工具的底层能力,全面升级推理、部署模块,优化 2.x 版本的设计,统一并优化了 Python API 和命令行接口(CLI)。部署能力现覆盖高性能推理、服务化部署及端侧部署三大场景。
-
适配飞桨框架 3.0,优化训练流程: 新版本已兼容飞桨 3.0 的 CINN 编译器等最新特性,静态图模型存储文件名由
xxx.pdmodel
改为xxx.json
。 -
统一模型名称: 对PaddleOCR3.0支持的模型命名体系进行了更新,采用更规范、统一的命名规则,为后续迭代与维护奠定基础。
v2.10.0
What's Changed
- update docs by @cuicheng01 in #14031
- update paddle2onnx doc by @inisis in #14038
- fix gpu memory growth by @zhangyubo0722 in #14037
- updata en docs by @dyning in #14036
- fix nan in PP-OCRv4 by @wangna11BD in #14043
- update a live promotion by @Zhiiixin in #14042
- reset latex ocr by @zhangyubo0722 in #14046
- Update pyproject.toml for add dependency by @Liyulingyue in #14058
- Fix
CMAKE_CXX_FLAGS
optimize flag by @Hirozy in #14059 - fix isnan_v2 is not supported in paddle2onnx by @GreatV in #14060
- ci: Fixed docs multi version error by @SWHL in #14048
- fix hyperlinks by @AmberC0209 in #14073
- fix nan in ppocrv4 for benchmark by @wangna11BD in #14072
- ci: Support seperate update of branch docs by @SWHL in #14079
- ci: fixed main doc ci by @SWHL in #14084
- Allow
create_predictor
function to accept array of ONNX Execution Providers by @Salmondx in #14078 - docs: update quickstart by @SWHL in #14108
- docs: add command line usage documentation of quickstart page by @SWHL in #14110
- docs: add installation documentation of paddle by @SWHL in #14117
- docs: fixed typo by @SWHL in #14118
- image without any text will show a warning by @GreatV in #14132
- doc: remove duplicate paragraphs by @GreatV in #14133
- docs: update paddle2onnx documentations by @GreatV in #14144
- [third-party] Fix the issue of inference errors with KIE mode in ONNX format by @Alex37882388 in #14138
- update tests PR CI github action by @GreatV in #14159
- 移除doc目录下文档,保留fonts和doc_i18n两个目录 by @SWHL in #14156
- 移除ppstructure目录下旧有文档 by @SWHL in #14161
- docs: fixed error image link (#14164) by @SWHL in #14165
- 更新i18n的首页内容到新站点 by @SWHL in #14166
- docs: fix i18n languange code error by @SWHL in #14167
- docs: fix syntax error by @SWHL in #14168
- docs: update i18n docs by @GreatV in #14169
- upgrade to numpy 2.0 and remove imgaug by @GreatV in #13937
- docs: format multi languange docs home page by @SWHL in #14170
- docs: add the missing image by @GreatV in #14180
- Create close_inactive_issues.yaml by @GreatV in #14183
- update hpi config by @zhangyubo0722 in #14076
- Update close_inactive_issues.yaml by @GreatV in #14189
- Update close_inactive_issues.yaml by @GreatV in #14190
- remove lock inactive issues by @GreatV in #14192
- fix benchmark bug by @changdazhou in #14194
- pre-commit autoupdate && pre-commit run --all-files by @cclauss in #14201
- Remove Python 2 compatibility dependency six by @cclauss in #14202
- update quick_start by @AmberC0209 in #14200
- rename train result by @zhangyubo0722 in #14217
- fix benchmark bug by @changdazhou in #14235
- fix benchmark det_r50_vd_pse_v2_0 train error by @GreatV in #14239
- update infer/utility.py to support json format model by @GreatV in #14233
- Support inference for GCU by @EnflameGCU in #14142
- update docs by @AmberC0209 in #14230
- fix: Title text partially missing issue in
recovery_to_markdown.py
by @Coobiw in #14216 - change_support list by @liuhongen1234567 in #14293
- support latexocr static train by @liuhongen1234567 in #14297
- docs: Fix chinese image being displayed on the english readme page by @khanfarhan10 in #14299
- docs: update quick_start and recognition doc by @GreatV in #14302
- add d2s_train_image_shape for static train by @liuhongen1234567 in #14312
- update install command by @AmberC0209 in #14314
- fix: unable to export images without text to docx format by @GreatV in #14306
- paddle.shape return int64 tensor by @wanghuancoder in #14318
- docs: add warning of Apolications part by @SWHL in #14338
- Update algorithm_rec_cppd.md by @GreatV in #14366
- Update 印章弯曲文字识别.md by @BUJIQI in #14368
- update_det_static by @Sunting78 in #14372
- fix:calcute the left_center_pt and right_center_pt from min_area_quad by @fangfangzk in #14363
- add unimernet model by @liuhongen1234567 in #14357
- fix shape64 by @wanghuancoder in #14376
- add slanext models by @liu-jiaxuan in #14374
- fix: replace
rec_image_shape
when manually set by @JesuisTong in #14371 - repair type bug for ppocrv3 by @liuhongen1234567 in #14397
- [WIP]support export with pir and no pir by @zhangyubo0722 in #14379
- Add pp formulanet by @liuhongen1234567 in #14429
- repair formula bug when export by @liuhongen1234567 in #14442
- modify export with pir by @zhangyubo0722 in #14441
- update SLANet inference weights for adapt to paddle3.0b2 by @cuicheng01 in #14467
- fix_server_v4_det_output by @Sunting78 in #14472
- fix label_dict save bug by @zhangyubo0722 in #14273
- add ppocrv4_doc dict by @liuhongen1234567 in #14499
- fix latex_ocr inference by @vivienfanghuagood in #14498
- fix SLANeXt export bug by @liu-jiaxuan in #14512
- add version control for export and modify hpi config by @zhangyubo0722 in #14513
- fix slanext export bug by @liu-jiaxuan in #14519
- repair bug in latexocr cpu infer and typo in bleu score by @liuhongen1234567 in #14552
- Fix language error and spelling mistakes in the documentation by @timminator in #14571
- Keep GitHub Actions up to date with GitHub's Dependabot by @cclauss in #14569
- repair train bug in multi gpu by @liuhongen1234567 in #14576
- build(deps): bump the github-actions group with 3 updates by @dependabot in #14573
- remove max inplace grad by @phlrain in #14596
- build(deps): bump pypa/gh-action-pypi-publish from 1.12.3 to 1.12.4 in the github-actions group by @dependabot in #14603
- CPP: emplace_back() replaces many push_back()...to improve performance by @nonwill in #14610
- Add Thai character dictionary for OCR recognition by @Thanajade in #14620
- CPP: Make functions mostly noexcept to improve runtime performance by @nonwill in #14613
- CPP: tidied file header includes by @nonwill in #14621
*...
v2.9.1
v2.9.0
What's Changed
- fix: table recognition content is not escaped properly by @GreatV in #13277
- fix bug when layout_predictor is None by @GreatV in #13279
- add url in pyproject, and update version number by @GreatV in #13274
- unifying data types in the SLAHead by @GreatV in #13276
- add PaddleX info to README by @TingquanGao in #13308
- Update expired link in quickstart.md by @ZeddYu in #13253
- optimize func: get_infer_gpuid by @GreatV in #13275
- fix slice op parameters not being passed correctly by @GreatV in #13319
- Solve ModuleNotFoundError: No module named 'tools.infer' by @myhloli in #13348
- Add hardware docs by @nepeplwu in #13329
- add paddlex link by @TingquanGao in #13316
- Fix the dictionary bug in tablerec inference by @Topdu in #13362
- add bn_dict.txt by @taeefnajib in #13373
- add missing docstring in paddleocr.py using copilot by @jzhang533 in #13344
- line 445 program.py by @ManikSinghSarmaal in #13389
- fix layout recovery import error by @GreatV in #13434
- Latexocr paddle by @liuhongen1234567 in #13401
- [doc]add amp train notes for detection train by @andyjiang1116 in #13481
- remove some of the less common dependencies by @GreatV in #13461
- docs: Add a new document site by @SWHL in #13375
- Update mkdocs.yml by @GreatV in #13487
- chore: Update issue template by @SWHL in #13505
- chore: Update bug report template by @SWHL in #13508
- Fix cpp_infer "--enable_mkldnn=false" not effective by @hiroi-sora in #13539
- Enable Main Branch Support for PaddleX by @zhangyubo0722 in #13523
- docs: Update README by @SWHL in #13543
- docs: Update README_en by @SWHL in #13545
- 修改错别字 by @MonkeyBrothers in #13544
- docs: Remove old applications docs by @SWHL in #13551
- fix: 'numpy' has no attribute 'astype' by @laolitou in #13554
- add latexocr docs and fix some typos by @GreatV in #13532
- chore(Issue_template): Add validation of Environment and MPE code by @SWHL in #13559
- skip text files when running test ci by @GreatV in #13561
- fix bug for paddlepaddle3.0 by @changdazhou in #13568
- docs: Update the pdf file path in the operation demonstration by @Gmgge in #13575
- support benchmark for paddlepaddle3.0 by @changdazhou in #13574
- improve the reading experience of some documents by @GreatV in #13562
- update dive into OCR book link by @GreatV in #13581
- docs: Shorten the image path and remove dupliate images by @SWHL in #13585
- docs: Fix docs errors by @SWHL in #13588
- skip text files when running test ci on push by @GreatV in #13582
- docs: Add android_demo docs by @SWHL in #13601
- fix download bug when use multi gpus by @changdazhou in #13610
- disable automatic checks for new version albumentations by @GreatV in #13583
- 修复LaTeXOCR 在paddleX中的一些问题 by @liuhongen1234567 in #13646
- update docs and remove out-of-date event by @GreatV in #13660
- setuptools 72.2.0 result in that MANIFEST.in is invalid by @TingquanGao in #13670
- update docs and remove old docs by @GreatV in #13662
- update docs and fix markdown render error by @GreatV in #13678
- chore: Update issue template by @SWHL in #13679
- cache Python dependencies and PaddleOCR files by @GreatV in #13682
- Add files via upload by @lingskr in #13685
- Update ch_PP-OCRv4_rec_distillation.yml by @jiqirenfeile in #13692
- Remove channel links from docs by @zhangyubo0722 in #13674
- Code Style Unification by @zhangyubo0722 in #13697
- docs: Remove doc/datasets directory and fix docs/datasets documents by @SWHL in #13700
- Provides Vietnamese dictionary and corpus by @lingskr in #13698
- Modify the data processing part of LaTeXOCR and replace the absolute path by a relative path by @liuhongen1234567 in #13702
- use setuptools-scm extracts PaddleOCR versions by @GreatV in #13716
- Repair the bug in the inference script for LaTeX OCR by @liuhongen1234567 in #13750
- fixed: mkldnn -> onednn by @achieve-dream1221 in #13757
- remove unused enumerate by @Kayzwer in #13760
- update applications/overview.md by @GreatV in #13763
- Fix setting of make border epoch by @Sunting78 in #13783
- Fix doc link in docs by @Topdu in #13792
- Add support for Hebrew Language and Alphabet by @johnlockejrr in #13797
- Add Syriac script support by @johnlockejrr in #13800
- update KIE docs by @GreatV in #13799
- fix the CI running errors in tests. by @GreatV in #13846
- Fix pir dy2st train by @0x45f in #13853
- fix SRN algorithm infer error by @GreatV in #13851
- update pretrain for benchmark by @changdazhou in #13820
- fix bugs for SLANet infer by @liu-jiaxuan in #13861
- fix version by @TingquanGao in #13895
- set --image_dir to be required by @GreatV in #13896
- support export after save model by @zhangyubo0722 in #13844
- fix hubserving run error by @GreatV in #13918
- fix lateocr bug by @zhangyubo0722 in #13920
- 1.在ppstructure管道中添加latex_ocr公式识别功能;2.添加pdf转markdown文件功能 by @ztyf-lq in #13868
- updata 2.9, adding new models and supporting all-in-one full developm… by @dyning in #13932
- updata 2.9, adding new models and supporting all-in-one full developm… by @dyning in #13933
- adding new models and supporting all-in-one full development tools by @dyning in #13934
- Update quick_start.md with html, not md by @dyning in #13935
- Update quick_start.md for paddlex by @dyning in #13936
- pdf to markdown document by @ztyf-lq in #13942
- Update algorithm_rec_vitstr_en.md by @GreatV in #13947
- update a live promotion by @Zhiiixin in #13954
- ci: Support multi version docs by @SWHL in #13957
- docs: Add tip of old documents by @SWHL in #13960
- ci: Fix mike error by @SWHL in #13962
- Update README.md, fixed broken quick start link by @Kozmosa in #13965
- fix broken link by @GreatV in #13970
- [NPU] cherrypick13983 by @Wang...