PaddleOCR/latest/version3.x/module_usage/text_recognition #15543

2025-06-03T05:34:46Z

giscus[bot]
bot Jun 3, 2025

PaddleOCR/latest/version3.x/module_usage/text_recognition

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

https://paddlepaddle.github.io/PaddleOCR/latest/version3.x/module_usage/text_recognition.html

SHOUshou0426 · 2025-06-03T05:34:47Z

SHOUshou0426
Jun 3, 2025 — with giscus

你好，我想问一下，PaddleOCR 的示例代码打印的json ，没有检测模型输出的检测框的阈值，类似于detect模型的conf 值，只有rec scores值

0 replies

SHOUshou0426 · 2025-06-03T06:06:11Z

SHOUshou0426
Jun 3, 2025

这上面没有det 检测模型的每个box 框的阈值 At 2025-06-03 13:35:06, "giscus[bot]" ***@***.***> wrote: PaddleOCR/latest/version3.x/module_usage/text_recognition Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices) https://paddlepaddle.github.io/PaddleOCR/latest/version3.x/module_usage/text_recognition.html — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

1 reply

liuhongen1234567 Jun 3, 2025
Collaborator

感谢您的反馈，这个确实是当前版本遗漏的一个地方，我们会在下一个wheel版本支持该参数，或者直接使用文本检测模块可以获取该参数，这里dt_scores 就是检测框的阈值。

SHOUshou0426 · 2025-06-03T06:39:06Z

SHOUshou0426
Jun 3, 2025

如何自己添加呢，我查看ocr.py 没有看到return 的json 都是继承在 2025-06-03 14:29:22，"liuhongen1234567" ***@***.***> 写道：感谢您的反馈，这个确实是当前版本遗漏的一个地方，我们会在下一个wheel版本支持该参数，或者直接使用文本检测模块可以获取该参数，这里dt_scores 就是检测框的阈值。 image.png (view on web) — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

0 replies

SHOUshou0426 · 2025-06-03T07:05:56Z

SHOUshou0426
Jun 3, 2025

你好，我查看到pipline.py 下保存了rec 的一些信息，是否我可以直接添加det 的信息即可在 2025-06-03 14:29:22，"liuhongen1234567" ***@***.***> 写道：感谢您的反馈，这个确实是当前版本遗漏的一个地方，我们会在下一个wheel版本支持该参数，或者直接使用文本检测模块可以获取该参数，这里dt_scores 就是检测框的阈值。 image.png (view on web) — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

1 reply

liuhongen1234567 Jun 3, 2025
Collaborator

应该不行，检测框还涉及到一个排序的过程，直接添加det信息，识别框和分数可能对应不上

SHOUshou0426 · 2025-06-03T07:27:45Z

SHOUshou0426
Jun 3, 2025

直接获取dt_scores 还不是已经排序后的结果吗，我打印了一些，我发现是最后的结果了， det_scores_list= [item["dt_scores"] foritemindet_results] 在 2025-06-03 15:25:37，"liuhongen1234567" ***@***.***> 写道：应该不行，检测框还涉及到一个排序的过程，直接添加det信息，识别框和分数可能对应不上 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

1 reply

liuhongen1234567 Jun 3, 2025
Collaborator

应该不是的，这个地方还进行了一个排序，

SHOUshou0426 · 2025-06-03T07:28:55Z

SHOUshou0426
Jun 3, 2025

我查看下面是直接det 的result 最后的结果了，在往后续都是rec 的存入result 的结果了在 2025-06-03 15:25:37，"liuhongen1234567" ***@***.***> 写道：应该不行，检测框还涉及到一个排序的过程，直接添加det信息，识别框和分数可能对应不上 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

1 reply

liuhongen1234567 Jun 3, 2025
Collaborator

获取这个检测分数具体有什么用途吗？如果只是筛选的话可以通过text_det_box_thresh 这个参数筛掉低于阈值的边界框。中间排序过程还比较复杂，不太容易还原出排序后的索引。

SHOUshou0426 · 2025-06-03T07:45:44Z

SHOUshou0426
Jun 3, 2025

这是添加后的值，目前查看应该是没有问题，你这边查看一下，验证是否正确，要和直接检测的示例代码，同步打印一下，但是发现了一个问题，我要去测试det 的conf 是否一直情况，det 示例代码，推理出来的检测效果和我端对端的效果不一致，模型都是同步的以下是det 示例代码推理结果在 2025-06-03 15:25:37，"liuhongen1234567" ***@***.***> 写道：应该不行，检测框还涉及到一个排序的过程，直接添加det信息，识别框和分数可能对应不上 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

0 replies

SHOUshou0426 · 2025-06-03T07:54:13Z

SHOUshou0426
Jun 3, 2025

主要是用于查看误报框的conf 值为多少，选择合理的thresh 过滤在 2025-06-03 15:52:41，"liuhongen1234567" ***@***.***> 写道：获取这个检测分数具体有什么用途吗？如果只是筛选的话可以通过text_det_box_thresh 这个参数筛掉低于阈值的边界框。中间排序过程还比较复杂，不太容易还原出排序后的索引。 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

1 reply

liuhongen1234567 Jun 3, 2025
Collaborator

您那边是否输入存在一定问题呢？我这边只能看到文字内容。和检测对应不上是正常的，端到端产线和OCR还对边界框进行了一定处理。

SHOUshou0426 · 2025-06-03T08:34:46Z

SHOUshou0426
Jun 3, 2025

好的，我想问一下paddlesilm 对paddleocr 的模型进行int8量化，GitHub描述为无损精度量化，是否还可以使用实例代码进行推理预测呢在 2025-06-03 16:00:15，"liuhongen1234567" ***@***.***> 写道：您那边是否输入存在一定问题呢？我这边只能看到文字内容。和检测对应不上是正常的，端到端产线和OCR还对边界框进行了一定处理。 image.png (view on web) — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

1 reply

liuhongen1234567 Jun 3, 2025
Collaborator

应该是可以的，不过paddleslim导出后的模型需要进行一定的修改，这个仓库导出的模型应该不符合现在paddleocr3.0的规范，比如：需要手动添加inference.yaml 等

SHOUshou0426 · 2025-06-03T08:49:51Z

SHOUshou0426
Jun 3, 2025

paddleslim 确保是无损精度下的int8 量化吗，保留精度，速度更快在 2025-06-03 16:47:58，"liuhongen1234567" ***@***.***> 写道：应该是可以的，不过paddleslim导出后的模型需要进行一定的修改，这个仓库导出的模型应该不符合现在paddleocr3.0的规范，比如：需要手动添加inference.yaml 等 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

1 reply

liuhongen1234567 Jun 3, 2025
Collaborator

这个仓库有点老，可能不适用于最新的模型。您那边可以尝试一下。

SHOUshou0426 · 2025-06-03T08:59:34Z

SHOUshou0426
Jun 3, 2025

提高paddleocr 的整体推理速度，目前已经使用的trt 和网络也采用了最轻量级网络，paddlesilm 仓库有点老，估计是不适配v3 模型，和新的推理框架，还有最后一点想问一下您，对于检测模型出现误检测情况，采用box 面积大小过滤，在送入到rec 模型识别，这个在什么位置呢，我看pipline 是没有的在 2025-06-03 16:56:02，"liuhongen1234567" ***@***.***> 写道：这个仓库有点老，可能不适用于最新的模型。您那边可以尝试一下。 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

1 reply

liuhongen1234567 Jun 3, 2025
Collaborator

您好， /usr/local/lib/python3.10/dist-packages/paddlex/inference/models/text_detection/processors.py 中的 DBPostProcess 有一个检测框过滤的操作，不知道是不是您所需要的。

SHOUshou0426 · 2025-06-03T09:28:43Z

SHOUshou0426
Jun 3, 2025

好的，感谢您在 2025-06-03 17:27:57，"liuhongen1234567" ***@***.***> 写道：您好， /usr/local/lib/python3.10/dist-packages/paddlex/inference/models/text_detection/processors.py 中的 DBPostProcess 有一个检测框过滤的操作，不知道是不是您所需要的。 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

0 replies

xushuwen2001 · 2025-06-20T07:45:30Z

xushuwen2001
Jun 20, 2025 — with giscus

你好，我想请问一下，有开源rec_mv3_none_none_ctc.yml配置的预训练识别模型吗？

1 reply

liuhongen1234567 Jun 20, 2025
Collaborator

您好，paddleocr2.0时代的模型可以在 https://github.com/PaddlePaddle/PaddleOCR/blob/4602329be9432db4328f28a3e16a04a9eb8e823e/docs/version2.x/ 目录下找到，比如：您要找的 rec_mv3_none_none_ctc.yml 在这个文档 https://github.com/PaddlePaddle/PaddleOCR/blob/4602329be9432db4328f28a3e16a04a9eb8e823e/docs/version2.x/algorithm/text_recognition/algorithm_rec_rosetta.md

chenmo7760 · 2025-06-22T12:00:05Z

chenmo7760
Jun 22, 2025 — with giscus

您好，请问怎么精确获取文本检测、分类、识别的时长呢？谢谢！

0 replies

zhourongming · 2025-08-01T08:33:52Z

zhourongming
Aug 1, 2025 — with giscus

请问PP-OCRv4_server_rec包含哪些层，训练时是否可以冻结指定的层，如何设置？

0 replies

SHOUshou0426 · 2025-08-05T07:04:03Z

SHOUshou0426
Aug 5, 2025 — with giscus

您好，训练ocrv4-rec 模型，使用en_dict key 里面不包含任何的符号信息，为什么export 出的inference.yml的dict 却包含了很多符号信息呢。我将符号信息去除模型识别出的结果完全不正确了，这种如何解决呢，我想保证最后输出的没有任何标点符号信息

0 replies

ctgushiwei · 2025-08-11T05:39:37Z

ctgushiwei
Aug 11, 2025 — with giscus

使用二次开发中的例子微调rec模型会报错，环境:
paddle2onnx 2.0.1
paddleocr 3.1.0
paddlepaddle-gpu 3.1.0
paddlex 3.1.3

Traceback (most recent call last):
File "/data/ppocr/PaddleOCR-main/./tools/train.py", line 272, in
main(config, device, logger, vdl_writer, seed)
File "/data/ppocr/PaddleOCR-main/./tools/train.py", line 225, in main
program.train(
File "/data/ppocr/PaddleOCR-main/tools/program.py", line 312, in train
for idx, batch in enumerate(train_dataloader):
File "/usr/local/lib/python3.10/dist-packages/paddle/io/reader.py", line 621, in iter
return _DataLoaderIterMultiProcess(self)
File "/usr/local/lib/python3.10/dist-packages/paddle/io/dataloader/dataloader_iter.py", line 431, in init
self._init_workers()
File "/usr/local/lib/python3.10/dist-packages/paddle/io/dataloader/dataloader_iter.py", line 448, in _init_workers
self._data_queue = multiprocessing.Queue()
File "/usr/lib/python3.10/multiprocessing/context.py", line 103, in Queue
return Queue(maxsize, ctx=self.get_context())
File "/usr/lib/python3.10/multiprocessing/queues.py", line 43, in init
self._rlock = ctx.Lock()
File "/usr/lib/python3.10/multiprocessing/context.py", line 68, in Lock
return Lock(ctx=self.get_context())
File "/usr/lib/python3.10/multiprocessing/synchronize.py", line 162, in init
SemLock.init(self, SEMAPHORE, 1, 1, ctx=ctx)
File "/usr/lib/python3.10/multiprocessing/synchronize.py", line 57, in init
sl = self._semlock = _multiprocessing.SemLock(
OSError: [Errno 28] No space left on device
Exception ignored in: <function _DataLoaderIterMultiProcess.del at 0x7fc79c14cf70>
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/paddle/io/dataloader/dataloader_iter.py", line 809, in del
self._try_shutdown_all()
File "/usr/local/lib/python3.10/dist-packages/paddle/io/dataloader/dataloader_iter.py", line 587, in _try_shutdown_all
if not self._shutdown:
AttributeError: '_DataLoaderIterMultiProcess' object has no attribute '_shutdown'

1 reply

liuhongen1234567 Aug 11, 2025
Collaborator

您好，根据报错看起来是您的磁盘没有空间了。

PaddleOCR/latest/version3.x/module_usage/text_recognition #15543

Uh oh!

giscus[bot] bot Jun 3, 2025

PaddleOCR/latest/version3.x/module_usage/text_recognition

Replies: 17 comments · 10 replies

Uh oh!

SHOUshou0426 Jun 3, 2025 — with giscus

Uh oh!

Uh oh!

liuhongen1234567 Jun 3, 2025 Collaborator

Uh oh!

Uh oh!

Uh oh!

liuhongen1234567 Jun 3, 2025 Collaborator

Uh oh!

Uh oh!

liuhongen1234567 Jun 3, 2025 Collaborator

Uh oh!

Uh oh!

liuhongen1234567 Jun 3, 2025 Collaborator

Uh oh!

Uh oh!

Uh oh!

liuhongen1234567 Jun 3, 2025 Collaborator

Uh oh!

Uh oh!

liuhongen1234567 Jun 3, 2025 Collaborator

Uh oh!

Uh oh!

liuhongen1234567 Jun 3, 2025 Collaborator

Uh oh!

Uh oh!

liuhongen1234567 Jun 3, 2025 Collaborator

Uh oh!

Uh oh!

xushuwen2001 Jun 20, 2025 — with giscus

Uh oh!

liuhongen1234567 Jun 20, 2025 Collaborator

Uh oh!

chenmo7760 Jun 22, 2025 — with giscus

Uh oh!

zhourongming Aug 1, 2025 — with giscus

Uh oh!

SHOUshou0426 Aug 5, 2025 — with giscus

Uh oh!

ctgushiwei Aug 11, 2025 — with giscus

Uh oh!

liuhongen1234567 Aug 11, 2025 Collaborator

giscus[bot]
bot Jun 3, 2025

Replies: 17 comments 10 replies

SHOUshou0426
Jun 3, 2025 — with giscus

liuhongen1234567 Jun 3, 2025
Collaborator

liuhongen1234567 Jun 3, 2025
Collaborator

liuhongen1234567 Jun 3, 2025
Collaborator

liuhongen1234567 Jun 3, 2025
Collaborator

liuhongen1234567 Jun 3, 2025
Collaborator

liuhongen1234567 Jun 3, 2025
Collaborator

liuhongen1234567 Jun 3, 2025
Collaborator

liuhongen1234567 Jun 3, 2025
Collaborator

xushuwen2001
Jun 20, 2025 — with giscus

liuhongen1234567 Jun 20, 2025
Collaborator

chenmo7760
Jun 22, 2025 — with giscus

zhourongming
Aug 1, 2025 — with giscus

SHOUshou0426
Aug 5, 2025 — with giscus

ctgushiwei
Aug 11, 2025 — with giscus

liuhongen1234567 Aug 11, 2025
Collaborator