文本识别训练过程中出现list index out of range问题 #12912
Replies: 4 comments
-
|
Beta Was this translation helpful? Give feedback.
-
你好你的问题解决了吗 |
Beta Was this translation helpful? Give feedback.
-
我也遇到一样的问题,substr明显有两个元素,字典也有这个字,但是它还是报错 输出及报错: data_line.decode('utf-8')= ABC03052_12.jpg 出 substr = data_line.strip('/n').split(self.delimiter)= ['ABC03052_12.jpg', '出'] |
Beta Was this translation helpful? Give feedback.
-
遇到了同样的问题,昨天自己查了很久,分享一下我的解决方法。我的是因为数据集划分后.txt文档每行中间有空格,修改PPOCRLabel/gen_ocr_train_val_test.py 中45-56行:train_txt.write("{}\t{}\n".format(image_copy_path, image_label)) 改为 train_txt.write("{}\t{}".format(image_copy_path, image_label)),即去掉\n换行,重新划分数据集,再训练时就不报错了。 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem
[2023/12/09 20:45:37] ppocr ERROR: When parsing line crop_img/table246_crop_11.jpg 6.车辆识别代号/车架号LHGCV1676P9038532
, error happened with msg: Traceback (most recent call last):
File "D:\PaddleOCR-release-2.7\ppocr\data\simple_dataset.py", line 158, in getitem
data['ext_data'] = self.get_ext_data()
File "D:\PaddleOCR-release-2.7\ppocr\data\simple_dataset.py", line 124, in get_ext_data
label = substr[1]
IndexError: list index out of range
进行模型微调的训练过程中,总是会出现如上述所示的错误,好像是显示训练数据有问题,但是不知道问题在哪里?不知道如何解决。 求各位大佬帮忙分析一下是什么原因导致。感激不尽!!!!@haobibo @WenmuZhou @ZeyuChen @bingooo
Beta Was this translation helpful? Give feedback.
All reactions