Extract tagged tables from pdf using 'StructTreeRoot' #3899
Unanswered
mllife
asked this question in
Looking for help
Replies: 1 comment
-
How make the code parse the tables from the StructTreeRoot and extract json or csv using this information? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
So, I am working with pdfs and now I need to see, if I can parse the tables in pdfs (if tagged)
Reference - https://accessible-pdf.info/en/basics/general/overview-of-the-pdf-tags/#table-elements
Attaching a sample pdf with tagged tables;
FishTables-only.pdf
Beta Was this translation helpful? Give feedback.
All reactions