Skip to content
Discussion options

You must be logged in to vote

This is a frequent and normal thing to happen. Text extraction extracts the text in the same sequence as stored in the file. Many creators do not store content in reading sequence.

You must establish the reading sequence yourself. There is the sort parameter that often helps - please read the documentation.
In other cases you must use your own code to do that by extracting text including coordinates, like get_text("dict").
But using get_text("words") is a good start, if you sort them and concatenate again with a space.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@hifiveszu
Comment options

@JorjMcKie
Comment options

Answer selected by hifiveszu
@hifiveszu
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants