Using the --words_dir argument #140
Closed
jonah-ramponi
started this conversation in
General
Replies: 1 comment
-
I needed to ensure that I'd rescaled the image to the same dimensions as the pdf. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi there,
I am wondering how this argument should be used. I assume that the user is intended to write some code which goes from (.pdf) -> (.json) where the .json is of the form "_words.json", and the image we are processing is of the form ".png" based on the code. You then provide "--words_dir ./Output" and it will pick up on the relevant json files.
The reason I wonder is that I cannot get the final excel files to fill; it appears to have the correct number of rows and columns, however they are empty. I assume that finding the bounding boxes has been done in a slightly different manner in the internal code and my code, and this is causing the issue.
I just wanted to check if there was a script for producing the "bounding box json" for an image, or if I've missed something obvious. Thanks.
Beta Was this translation helpful? Give feedback.
All reactions