Skip to content

build(deps): bump ocrmypdf from 16.13.0 to 17.0.1 in /pdf#255

Merged
github-actions[bot] merged 1 commit intomainfrom
dependabot/pip/pdf/ocrmypdf-17.0.1
Feb 6, 2026
Merged

build(deps): bump ocrmypdf from 16.13.0 to 17.0.1 in /pdf#255
github-actions[bot] merged 1 commit intomainfrom
dependabot/pip/pdf/ocrmypdf-17.0.1

Conversation

@dependabot
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Feb 6, 2026

Bumps ocrmypdf from 16.13.0 to 17.0.1.

Changelog

Sourced from ocrmypdf's changelog.

v17.0.1

  • Fixed output file size inflation when using pypdfium as rasterizer and force-ocr mode.

v17.0.0

Breaking changes

  • Plugin interface migration: Plugin hooks now receive OcrOptions objects instead of argparse.Namespace objects. Most plugins will continue working due to duck-typing compatibility, but plugin developers should update their type hints from Namespace to OcrOptions.
  • Built-in plugins no longer modify options in-place, improving immutability and code clarity.
  • Lossy JBIG2 removed: The --jbig2-lossy and --jbig2-page-group-size options have been removed due to well-documented risks of character substitution errors. These options are now deprecated and will emit warnings if used. Only lossless JBIG2 compression is supported.
  • PDF/A output behavior change: If neither Ghostscript nor verapdf is installed, --output-type auto (the new default) will produce a standard PDF instead of PDF/A. This is a change from previous versions where Ghostscript was required and PDF/A was always produced. This configuration is rare but users should be aware of the change.

New features

  • pypdfium2 rasterizer: Added optional pypdfium2-based PDF rasterization plugin as an alternative to Ghostscript for page rendering. Use --rasterizer pypdfium to enable (requires pip install pypdfium2). The default --rasterizer auto prefers pypdfium when available and falls back to Ghostscript.

  • Pluggable OCR engines: New --ocr-engine option allows selecting OCR engines:

    • auto (default): Uses Tesseract
    • tesseract: Explicit Tesseract selection
    • none: Skip OCR entirely for PDF processing-only workflows

    This prepares the foundation for future third-party OCR engine plugins.

  • Smart PDF/A conversion: New --output-type auto (now the default) produces best-effort PDF/A output without requiring Ghostscript when the verapdf validator is available. Falls back to traditional Ghostscript conversion when needed.

  • verapdf integration: Added optional verapdf validation for fast PDF/A conversion. When available, OCRmyPDF attempts speculative PDF/A conversion using pikepdf, validates with verapdf, and skips Ghostscript if validation passes.

  • Optional Ghostscript: As a consequence of the changes above, Ghostscript is no longer a required dependency. It is optional.

  • fpdf2 text renderer: Replaced legacy hOCR text renderer with new fpdf2-based implementation, providing better multilingual support and more accurate text positioning.

  • Improved Occulta glyphless font: The new Occulta font provides better handling of zero-width markers and double-width CJK characters for accurate text layer positioning.

  • Expanded multilingual font support: Added FontProvider infrastructure with language-aware font selection for Devanagari (Hindi, Sanskrit, Marathi, Nepali), CJK (Chinese, Japanese, Korean), Arabic script, and many other scripts. System font discovery reduces package size.

  • Simplified mode selection: New --mode (-m) argument consolidates processing options:

... (truncated)

Commits
  • 3abe8f7 v17.0.1 release notes
  • 64f45b7 Fix pypdfium type checking
  • 7e939ad Fix pypdfium rasterizer to respect raster_device colorspace
  • 297fb78 Update uv.lock (for protobuf)
  • ad30dd9 Merge branch 'release/v17'
  • e77f79a Merge branch 'main' of github.com:ocrmypdf/OCRmyPDF
  • c84fc56 Update CLI completions to match current options
  • 0a0756b Tidy long lines and unnested with blocks
  • c5d3ef4 Tighten ruff rules and modernize style
  • 6b37583 Refactor: move ocr_element to a better location
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [ocrmypdf](https://github.com/ocrmypdf/OCRmyPDF) from 16.13.0 to 17.0.1.
- [Release notes](https://github.com/ocrmypdf/OCRmyPDF/releases)
- [Changelog](https://github.com/ocrmypdf/OCRmyPDF/blob/main/docs/release_notes.md)
- [Commits](ocrmypdf/OCRmyPDF@v16.13.0...v17.0.1)

---
updated-dependencies:
- dependency-name: ocrmypdf
  dependency-version: 17.0.1
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot bot added dependencies Pull requests that update a dependency file python Pull requests that update python code labels Feb 6, 2026
@github-actions github-actions bot added the build [Conventional Commits] Changes that affect the build system or external dependencies label Feb 6, 2026
@github-actions github-actions bot enabled auto-merge (squash) February 6, 2026 22:07
@github-actions github-actions bot merged commit 59cc158 into main Feb 6, 2026
54 checks passed
@github-actions github-actions bot deleted the dependabot/pip/pdf/ocrmypdf-17.0.1 branch February 6, 2026 22:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

build [Conventional Commits] Changes that affect the build system or external dependencies dependencies Pull requests that update a dependency file python Pull requests that update python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants