You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm currently exploring the capabilities of docling for parsing PDF files and converting them into markdown. In my tests I've noticed that header images were not recognized as furniture, while the footer was successfully identified. I don't want to have the header image in the exported markdown file.
Are there options to:
customize what is recognized as furniture or body within a document
customize what part of the document is exported (e.g. with a bounding box or margins)
Or are there any other options to exclude the header images from the exported markdown?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Hey everyone,
I'm currently exploring the capabilities of docling for parsing PDF files and converting them into markdown. In my tests I've noticed that header images were not recognized as furniture, while the footer was successfully identified. I don't want to have the header image in the exported markdown file.
Are there options to:
Or are there any other options to exclude the header images from the exported markdown?
Thank you!
Beta Was this translation helpful? Give feedback.
All reactions