pdf2djvu-ocr

IMPORTANT (QUALITY) DISCLAIMER

This script is still young and the resulting .djvu files are not so good, often bigger than the original and with medium to low quality. I hope people will help me improve this. So before converting huge amount of documents do some performance/quality benchmarking.

Description

This Script follow the discussion on SuperUser to help convert from scanned PDF to DjVu+OCR.

Dependencies

stylerc: bash output style ;
pdfsandwich ;
tesseract-ocr ;
pdf2djvu.

Usage

The default behavior, i.e. call without arguments, will look for PDF files in the current working repository (glob: ./*.pdf) :

pdf2djvu-ocr

Otherwise you can specify a path

pdf2djvu-ocr /path/to/files/**/*.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
LICENSE-MIT.md		LICENSE-MIT.md
README.md		README.md
benchmark.sh		benchmark.sh
makefile		makefile
pdf2djvu-ocr.sh		pdf2djvu-ocr.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pdf2djvu-ocr

IMPORTANT (QUALITY) DISCLAIMER

Description

Dependencies

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

pdf2djvu-ocr

IMPORTANT (QUALITY) DISCLAIMER

Description

Dependencies

Usage

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages