DocxConverter images #1497

rlendvai-intuitech · 2025-12-08T12:53:21Z

rlendvai-intuitech
Dec 8, 2025

Hi team,

I am using MarkItDown with an configured llm_client (GPT-4o). Image captioning works perfectly for .pptx files, where images are analyzed and described by the LLM.

However, when converting .docx files, images seem to be ignored or result in raw/truncated data:image... strings without any description.

Is LLM Vision support currently implemented for the DocxConverter (which seems to use mammoth), or is this a known limitation?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DocxConverter images #1497

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

DocxConverter images #1497

Uh oh!

rlendvai-intuitech Dec 8, 2025

Replies: 0 comments

rlendvai-intuitech
Dec 8, 2025