Skip to content

edouard-lopez/pdf2djvu-ocr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdf2djvu-ocr

IMPORTANT (QUALITY) DISCLAIMER

This script is still young and the resulting .djvu files are not so good, often bigger than the original and with medium to low quality. I hope people will help me improve this. So before converting huge amount of documents do some performance/quality benchmarking.

Description

This Script follow the discussion on SuperUser to help convert from scanned PDF to DjVu+OCR.

Dependencies

Usage

The default behavior, i.e. call without arguments, will look for PDF files in the current working repository (glob: ./*.pdf) :

pdf2djvu-ocr

Otherwise you can specify a path

pdf2djvu-ocr /path/to/files/**/*.pdf

About

Script to help convert from scanned PDF to DjVu+OCR. Dependencies: pdfsandwich tesseract pdf2djvu

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors