Skip to content
Discussion options

You must be logged in to vote

In your example, the little text-like things are vector graphics. They are built in a way to look like numbers or symbols from the ZapfDingbats font.
While you can extract vector graphics with PyMuPDF (and also redraw them somewhere else) using page.get_drawings(), that is probably not the solution you are looking for. Presumably you need an image that includes those symbols.

To find out that you have this situation, do this:

page=doc[0]
page.get_images()  # image has xref 808
[(808, 0, 697, 481, 8, 'DeviceGray', '', 'X17', 'FlateDecode')]
imgbbox=page.get_image_rects(808)
imgbbox
[Rect(58.20399856567383, 110.16300964355469, 392.99700927734375, 341.1340026855469)]

# check if there are dr…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@arashrad
Comment options

Answer selected by arashrad
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants