Replies: 1 comment 1 reply
-
|
Using force-ocr causes all pages to be re-rendered as images in the widest colorspace and highest resolution used on the page, which will typically increase file size. --redo-ocr might be helpful |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi
As soon I start to process the images in the PDF using
ocrmypdfmy PDFs always gets larger than the original. So here are my examples including I test file test-scan.pdf:Only optimization: Ratio 0.81x (as expected)
ocrmypdf --skip-text --optimize 3 ./test-scan.pdf ./test-scan-out.pdfOnly OCR: Ratio 1.64x
ocrmypdf --force-ocr -l eng --optimize 3 ./test-scan.pdf ./test-scan-out.pdfOCR + Image optimization: Ratio 2.85x
ocrmypdf --force-ocr -l eng --rotate-pages --deskew --clean --optimize 3 ./test-scan.pdf ./test-scan-out.pdfWhen I open the PDF in Acrobat and optimize, it usually gets smaller than the original.
Any ideas or hint's what I'm doing wrong?
Thanks
Beta Was this translation helpful? Give feedback.
All reactions