docx文档解析报错:RecursionError: maximum recursion depth exceeded #15667
-
环境:python3.11 Traceback (most recent call last):
File "/root/output/test.py", line 8, in <module>
output = pipeline.predict(input_file)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/root/base_paddleOCR/lib64/python3.11/site-packages/paddleocr/_pipelines/pp_structurev3.py", line 207, in predict
return list(
^^^^^
File "/root/base_paddleOCR/lib64/python3.11/site-packages/paddlex/inference/pipelines/_parallel.py", line 129, in predict
yield from self._pipeline.predict(
File "/root/base_paddleOCR/lib64/python3.11/site-packages/paddlex/inference/pipelines/layout_parsing/pipeline_v2.py", line 1041, in predict
for batch_data in self.batch_sampler(input):
File "/root/base_paddleOCR/lib64/python3.11/site-packages/paddlex/inference/common/batch_sampler/base_batch_sampler.py", line 80, in __call__
yield from self.sample(input)
File "/root/base_paddleOCR/lib64/python3.11/site-packages/paddlex/inference/common/batch_sampler/image_batch_sampler.py", line 115, in sample
yield from self.sample(file_list)
File "/root/base_paddleOCR/lib64/python3.11/site-packages/paddlex/inference/common/batch_sampler/image_batch_sampler.py", line 115, in sample
yield from self.sample(file_list)
File "/root/base_paddleOCR/lib64/python3.11/site-packages/paddlex/inference/common/batch_sampler/image_batch_sampler.py", line 115, in sample
yield from self.sample(file_list)
[Previous line repeated 989 more times]
File "/root/base_paddleOCR/lib64/python3.11/site-packages/paddlex/inference/common/batch_sampler/image_batch_sampler.py", line 81, in sample
batch = ImgBatch()
^^^^^^^^^^
File "/root/base_paddleOCR/lib64/python3.11/site-packages/paddlex/inference/common/batch_sampler/image_batch_sampler.py", line 29, in __init__
super().__init__()
RecursionError: maximum recursion depth exceeded |
Beta Was this translation helpful? Give feedback.
Answered by
liuhongen1234567
Jun 10, 2025
Replies: 2 comments 2 replies
-
您好,当前版本paddleocr暂不支持传入docx,只支持pdf格式,可以另存为pdf试试 |
Beta Was this translation helpful? Give feedback.
1 reply
Answer selected by
SWHL
-
呃,是什么场景需要对docx文档进行解析呢?是需要把word文档转为markdown格式吗? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
您好,当前版本paddleocr暂不支持传入docx,只支持pdf格式,可以另存为pdf试试