Using the --words_dir argument #140

jonah-ramponi · 2023-09-11T11:36:14Z

jonah-ramponi
Sep 11, 2023

Hi there,

I am wondering how this argument should be used. I assume that the user is intended to write some code which goes from (.pdf) -> (.json) where the .json is of the form "_words.json", and the image we are processing is of the form ".png" based on the code. You then provide "--words_dir ./Output" and it will pick up on the relevant json files.

The reason I wonder is that I cannot get the final excel files to fill; it appears to have the correct number of rows and columns, however they are empty. I assume that finding the bounding boxes has been done in a slightly different manner in the internal code and my code, and this is causing the issue.

I just wanted to check if there was a script for producing the "bounding box json" for an image, or if I've missed something obvious. Thanks.

jonah-ramponi · 2023-09-12T10:02:52Z

jonah-ramponi
Sep 12, 2023
Author

I needed to ensure that I'd rescaled the image to the same dimensions as the pdf.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Using the --words_dir argument #140

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Using the --words_dir argument #140

Uh oh!

jonah-ramponi Sep 11, 2023

Replies: 1 comment

Uh oh!

jonah-ramponi Sep 12, 2023 Author

jonah-ramponi
Sep 11, 2023

jonah-ramponi
Sep 12, 2023
Author