Image extraction #2582
Replies: 4 comments 4 replies
-
The method |
Beta Was this translation helpful? Give feedback.
-
Thanks. May I know how page.get_images_rects(xref) computes the bounding box values? |
Beta Was this translation helpful? Give feedback.
-
Had a look. |
Beta Was this translation helpful? Give feedback.
-
Thanks for the explanation |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
There is one image in the PDF but page.get_images() gives the following output
[(18, 0, 354, 118, 8, 'DeviceRGB', '', 'I0', 'DCTDecode'), (19, 0, 1181, 1772, 8, 'DeviceRGB', '', 'I1', 'DCTDecode'), (18, 0, 354, 118, 8, 'DeviceRGB', '', 'I0', 'DCTDecode')]
Can you let me know how the image file and bbox coordinates can be extracted?
In the internal structure it is represented as Form XObject with /BBox [ 0 0 595.276 76.53543 ] but the bounding box is not getting drawn with these values.
example_010.pdf
Beta Was this translation helpful? Give feedback.
All reactions