Where exactly comes the block_no from? #2549
Replies: 1 comment
-
All items (text and images) are always extracted in the sequence as they are stored in the pages contents object(s). This is also the sequence by which all PDF viewers process these objects. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
In the documentation is only stated every image gets a block and a block_no assigned. But how about Text Blocks? Where does the deicision come from counts as a block?
Is a page segmentation performed or does block information stem from the time when the pdf processor created the PDF from eg. Word TextFields?
THX
Beta Was this translation helpful? Give feedback.
All reactions