You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
同一个模型部署两套,一份在notebook,一份在ai服务,识别的文档一样。在notebook中用python命令的方式调用ocr识别正常,但是在ai服务器中通过python程序调用时,返回值部分段落是感叹号和少量乱码,什么原因?命令是mineru -p 输入.pdf -o 输出目录 这种模式。方式是vlm-http-client。 英伟达下正常,昇腾下,notebook下命令调用正常,python程序调用部分段落异常,有感叹号和其它乱码。
python调用代码:
file_name = str(Path(path).stem)
pdf_bytes = read_fn(path)
new_pdf_bytes = convert_pdf_bytes_to_bytes_by_pypdfium2(pdf_bytes, start_page_id, end_page_id)
backend = backend[4:]
middle_json, infer_result = vlm_doc_analyze(
new_pdf_bytes,
image_writer=None,
backend=backend,
server_url=server_url,
callback_func=bound_callback
)
notebook调用命令:
python demo.py -i pdfs -o out4
Beta Was this translation helpful? Give feedback.
All reactions