Skip to content
Discussion options

You must be logged in to vote

This is coded like so in the PDF. For whatever weird reason. Maybe to take some invisible notes.
You would have seen the crazy coordinates if you had looked at the words' coordinates.
You can only heal this by specifying the page rectangle as the clip: page.get_text("words", clip=page.rect).
Here is the PDF source code as a proof:

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@kvrameshreddy
Comment options

Answer selected by kvrameshreddy
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #1681 on April 20, 2022 16:23.