是否支持doc、docx文件的识别?
#12901
Replies: 1 comment
-
doc docx文件直接解析不就好了 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
G:\conda\envs\dasel\python.exe E:\PaddleOCR\PaddleOCR-release-2.7\second.py
[2023/11/09 21:28:12] ppocr DEBUG: Namespace(alpha=1.0, alphacolor=(255, 255, 255), benchmark=False, beta=1.0, binarize=False, cls_batch_num=6, cls_image_shape='3, 48, 192', cls_model_dir=None, cls_thresh=0.9, cpu_threads=10, crop_res_save_dir='./output', det=True, det_algorithm='DB', det_box_type='quad', det_db_box_thresh=0.6, det_db_score_mode='fast', det_db_thresh=0.3, det_db_unclip_ratio=1.5, det_east_cover_thresh=0.1, det_east_nms_thresh=0.2, det_east_score_thresh=0.8, det_limit_side_len=960, det_limit_type='max', det_model_dir='C:\Users\10362/.paddleocr/whl\det\ch\ch_PP-OCRv4_det_infer', det_pse_box_thresh=0.85, det_pse_min_area=16, det_pse_scale=1, det_pse_thresh=0, det_sast_nms_thresh=0.2, det_sast_score_thresh=0.5, draw_img_save_dir='./inference_results', drop_score=0.5, e2e_algorithm='PGNet', e2e_char_dict_path='./ppocr/utils/ic15_dict.txt', e2e_limit_side_len=768, e2e_limit_type='max', e2e_model_dir=None, e2e_pgnet_mode='fast', e2e_pgnet_score_thresh=0.5, e2e_pgnet_valid_set='totaltext', enable_mkldnn=False, fourier_degree=5, gpu_id=0, gpu_mem=500, help='==SUPPRESS==', image_dir=None, image_orientation=True, invert=False, ir_optim=True, kie_algorithm='LayoutXLM', label_list=['0', '180'], lang='ch', layout=True, layout_dict_path='E:\PaddleOCR\PaddleOCR-release-2.7\ppocr\utils\dict\layout_dict\layout_cdla_dict.txt', layout_model_dir='C:\Users\10362/.paddleocr/whl\layout\picodet_lcnet_x1_0_fgd_layout_cdla_infer', layout_nms_threshold=0.5, layout_score_threshold=0.5, max_batch_size=10, max_text_length=25, merge_no_span_structure=True, min_subgraph_size=15, mode='structure', ocr=True, ocr_order_method=None, ocr_version='PP-OCRv4', output='./output', page_num=0, precision='fp32', process_id=0, re_model_dir=None, rec=True, rec_algorithm='SVTR_LCNet', rec_batch_num=6, rec_char_dict_path='E:\PaddleOCR\PaddleOCR-release-2.7\ppocr\utils\ppocr_keys_v1.txt', rec_image_inverse=True, rec_image_shape='3, 48, 320', rec_model_dir='C:\Users\10362/.paddleocr/whl\rec\ch\ch_PP-OCRv4_rec_infer', recovery=False, save_crop_res=False, save_log_path='./log_output/', scales=[8, 16, 32], ser_dict_path='../train_data/XFUND/class_list_xfun.txt', ser_model_dir=None, show_log=True, sr_batch_num=1, sr_image_shape='3, 32, 128', sr_model_dir=None, structure_version='PP-StructureV2', table=True, table_algorithm='TableAttn', table_char_dict_path='E:\PaddleOCR\PaddleOCR-release-2.7\ppocr\utils\dict\table_structure_dict_ch.txt', table_max_len=488, table_model_dir='C:\Users\10362/.paddleocr/whl\table\ch_ppstructure_mobile_v2.0_SLANet_infer', total_process_num=1, type='ocr', use_angle_cls=False, use_dilation=False, use_gpu=True, use_mp=False, use_npu=False, use_onnx=False, use_pdf2docx_api=False, use_pdserving=False, use_space_char=True, use_tensorrt=False, use_visual_backbone=True, use_xpu=False, vis_font_path='./doc/fonts/simfang.ttf', warmup=False)
2023-11-09 21:28:12 INFO: Loading faiss with AVX2 support.
2023-11-09 21:28:12 INFO: Successfully loaded faiss with AVX2 support.
E1109 21:28:14.377789 10944 analysis_predictor.cc:1716] Allocate too much memory for the GPU memory pool, assigned 8000 MB
E1109 21:28:14.378798 10944 analysis_predictor.cc:1719] Try to shink the value by setting AnalysisConfig::EnableUseGpu(...)
Traceback (most recent call last):
File "E:\PaddleOCR\PaddleOCR-release-2.7\second.py", line 10, in
result = table_engine(img)
File "E:\PaddleOCR\PaddleOCR-release-2.7\paddleocr.py", line 766, in call
res, _ = super().call(
File "E:\PaddleOCR\PaddleOCR-release-2.7\ppstructure\predict_system.py", line 98, in call
cls_res = next(cls_result)
File "G:\conda\envs\dasel\lib\site-packages\paddleclas\paddleclas.py", line 704, in predict_cls
raise ImageTypeError(err)
paddleclas.paddleclas.ImageTypeError: Please input legal image! The type of image supported by PaddleClas are: NumPy.ndarray and string of local path or Ineternet URL
Beta Was this translation helpful? Give feedback.
All reactions