Replies: 1 comment 2 replies
-
|
你的代码看起来是正确的! 请检查以下几点:
ct = vlm_union_make(pdf_info, MakeMode.MM_MD)
print(type(ct)) # 应该是 <class 'str'>
print(ct[:500]) # 查看前 500 字符
from mineru.data.data_reader_writer import FileBasedDataWriter
image_writer = FileBasedDataWriter("./images")
middle_json, infer_result = vlm_doc_analyze(
new_pdf_bytes,
image_writer=image_writer,
backend=backend,
server_url=server_url,
callback_func=bound_callback
)
pdf_info = middle_json["pdf_info"]
ct = vlm_union_make(pdf_info, MakeMode.MM_MD, img_buket_path="images")
print(f"pdf_info 内容: {pdf_info}")如果 参考:官方 demo 和 vlm_union_make 实现 To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
用mineru-api --host 0.0.0.0 --port的启动方式,调用web接口,就是速度慢,文件大一点就超时。而后使用mineru-vllm-server --port 的方式启动,通过vlm客户端调用,生成的结果不是markdown格式,代码如下:
middle_json, infer_result = vlm_doc_analyze(
new_pdf_bytes,
image_writer=None,
backend=backend,
server_url=server_url,
callback_func=bound_callback
)
pdf_info = middle_json["pdf_info"]
ct = vlm_union_make(pdf_info, MakeMode.MM_MD) 。
我查看了下官方代码:
class MakeMode:
MM_MD = 'mm_markdown'
NLP_MD = 'nlp_markdown'
CONTENT_LIST = 'content_list'
CONTENT_LIST_V2 = 'content_list_v2'
四种模式都尝试了,还是到不到md格式数据,怎么解决
Beta Was this translation helpful? Give feedback.
All reactions