Replies: 4 comments 16 replies
-
Hello @aborruso I see, yes indeed, we have a post processing step in our table model that removes fully empty columns and rows, as it's not obvious in every case if it's intentionally empty column (depending on table styling). In your case it's clear it's intentional. Thanks for suggesting this! |
Beta Was this translation helpful? Give feedback.
-
Created an issue, we will look into it very soon: #204 |
Beta Was this translation helpful? Give feedback.
-
Hi @maxmnemonic I'm adding here a note related to the same PDF: the last rows at page 1 are not extracted well and an extra row is produced with incorrectly arranged cells (see below). Is it useful to create a new issue about this? Thank you |
Beta Was this translation helpful? Give feedback.
-
Hi @maxmnemonic @dolfim-ibm , Just wanted to check if this is any updates on how to retain empty columns during text extration from PDF files. I am facing the similar issues as the ones mentioned here. Your reply on this will be greatly appreciated. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I have this sample PDF sample PDF.
If I try to extract content from it via cli or via python code the column with no values on page 2 is removed, making it impossible to merge the two tables.
Is there no way to output the empty columns as well?
Thank you
Beta Was this translation helpful? Give feedback.
All reactions