Skip to content
Discussion options

You must be logged in to vote

The text matrix Tm in PDF /Contents source is not directly available. Its use is also not intended in PyMuPDF - although you could of course read the /Contents as a stream and dig your way through to all information you want.

The bbox accompanying extracted text always already contains any computations required to deliver the correct value.

So you can use those bboxes (plus the span["origin"] - which is even more important) for inserting the span text.

Replies: 1 comment 11 replies

Comment options

You must be logged in to vote
11 replies
@JorjMcKie
Comment options

@meghanaviyyapu
Comment options

@meghanaviyyapu
Comment options

@JorjMcKie
Comment options

@meghanaviyyapu
Comment options

Answer selected by JorjMcKie
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants