table extraction within a rectangle #3942
Unanswered
renraeldab
asked this question in
Looking for help
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Table._get_header() starts from the first line above the table upwards, which exceeds the "clip" rectangle I provide to page.find_tables(). And it usually makes mistakes.
The page is like:
table title
table content
I have the correct rectangle of the table, which includes the headers. But pymupdf uses the table title as headers, and the markdown looks like:
Beta Was this translation helpful? Give feedback.
All reactions