Extracting images from PDF #1610
Replies: 2 comments 9 replies
-
Yeah, old baggage is kind of a curse at times. Some help is So using the |
Beta Was this translation helpful? Give feedback.
-
The Nonetheless, inline images (those with xref=0) mostly are (by PDF recommendation: should be) less than 4KB in size. Typically they are used for little eyecatchers, special bullet points and such. But I have also seen other examples. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello @JorjMcKie,
Looking for ways to extract images visible in a PDF and find their Rect, or location and height+width
Came across this script to extract images. The method use is
doc.get_page_images(pno)
.However, as stated in this comment this method may list images that are not shown in the pdf.
Used this script
However, noticed many duplicated entries (based on the Rect value) in the output.
Any ideas to achieve the desired outcome would help.
Beta Was this translation helpful? Give feedback.
All reactions