TejOCR

OCR inside Writer, with predictable output behavior

TejOCR is a LibreOffice Writer extension that performs OCR from:

a Writer-selected image object, or
a local image file.

The extension inserts recognized text based on the selected output mode with fallbacks for UI or session capability differences.

What's New In 0.2.0

OCR Complete is now a structured dialog instead of a dense result dump:
- grouped sections,
- better requested/effective diagnostics,
- cleaner language display,
- scrollable source lists for larger batches.
OCR-inserted Writer text now defaults to 6 pt for cursor insertion, text-box output, and replace-image flows.
Package/install metadata is corrected for stricter LibreOffice environments, including the Windows license-path failure class.
Release docs now include:

UI snapshots

Settings Main UI

Help UI

Setup & Diagnostics UI

Runtime in one screen (ASCII)

┌─────────────────────┐
│ Writer UI/Toolbar   │
│ (menu/commands)     │
└──────────┬──────────┘
           v
    ┌───────────────────────┐
    │ UNO dispatch URL      │
    │ (ProtocolHandler.xcu) │
    └──────────┬────────────┘
               v
      ┌──────────────────────┐
      │ TejOCRService        │
      │ (tejocr_service.py) │
      └─────┬────────┬───────┘
            │        │
            │        ├─ Settings surface
            │        │    -> Settings
            │        │    -> Advanced Engine Parameters
            │        │    -> Setup & Diagnostics
            │        │    -> Help
            │        │    -> A Message
            │        │
            │        └─ OCR run surface
            │             -> OCR options dialog/fallback
            │             -> Preview / Review fallback
            │             -> OCR Complete
            │
            v
      OCR source
   selected image | file | PDF
            v
      ┌──────────────────────────┐
      │ OCR Engine               │
      │ (tejocr_engine.py)       │
      │ - bounded OCR plan       │
      │ - CLI tesseract runtime  │
      │ - PDF page streaming     │
      │ - requested/effective    │
      └─────────┬────────────────┘
                v
      ┌──────────────────────────┐
      │ Output Router            │
      │ (tejocr_output.py)       │
      │ at_cursor | clipboard    │
      │ new_text_box | replace   │
      │ inserted text -> 6 pt    │
      └──────────────────────────┘

%%{init: {"theme":"base","themeVariables":{"lineColor":"#6d28d9","fontSize":"14px","fontFamily":"Inter, Segoe UI, Arial","nodeTextColor":"#111827","textColor":"#111827","lineWidth":"2","signalColor":"#0f766e"}}}%%
flowchart TD
  A["Writer UI/Toolbar"] --> B["Protocol URL via ProtocolHandler.xcu"]
  B --> C["TejOCRService (tejocr_service.py)"]
  C --> D["Settings surface"]
  D --> D1["Settings / Advanced Params / Setup / Help / A Message"]
  C --> E["_perform_ocr_with_options() / _perform_batch_ocr()"]
  E --> F["Option dialog or fallback defaults"]
  F --> G["engine.perform_ocr()"]
  G --> H["resolve plan + preprocess + Tesseract"]
  H --> I["preview/review if enabled"]
  I --> J["handle_ocr_output()"]
  J --> K["at_cursor / clipboard / new_text_box / replace_image"]
  J --> L["OCR Complete dialog"]
  classDef ui fill:#93c5fd,color:#0f172a,stroke:#1d4ed8,stroke-width:2px;
  classDef service fill:#22c55e,color:#052e16,stroke:#15803d,stroke-width:2px;
  classDef engine fill:#f59e0b,color:#0f172a,stroke:#b45309,stroke-width:2px;
  classDef output fill:#db2777,color:#ffffff,stroke:#be185d,stroke-width:2px;
  class A ui;
  class B service;
  class C service;
  class D service;
  class D1 service;
  class E service;
  class F service;
  class G engine;
  class H engine;
  class I output;
  class J output;
  class K output;
  class L output;

replace_image is only valid for Writer-selected-image flow.

What it supports

Input sources
- Selected Writer image
- Local image file (single or multi-select for batch processing)
- Multi-page PDF documents (requires pdftoppm/poppler-utils or mutool installed)
Output modes
- Insert at cursor
- Copy to clipboard
- Insert into new text box
- Replace selected image (selection-only)
Compatibility mode
- If UNO dialog UI services are unavailable, TejOCR switches to fallback prompts and still runs OCR.

Important support note

selected image source -> can use replace_image
file source          -> cannot target an image replacement
                       -> automatically uses insertion-compatible behavior

%%{init: {"theme":"base","themeVariables":{"lineColor":"#6d28d9","fontSize":"14px","fontFamily":"Inter, Segoe UI, Arial","nodeTextColor":"#111827","textColor":"#111827","lineWidth":"2","signalColor":"#0f766e"}}}%%
flowchart LR
  A["selected image"] --> B["replace_image allowed"]
  B --> C["remove graphic and insert text"]
  D["file input"] --> E["replace_image rejected"] --> F["insert-compatible output used"]
  classDef image fill:#2563eb,color:#ffffff,stroke:#1d4ed8,stroke-width:2px;
  classDef action fill:#22c55e,color:#052e16,stroke:#15803d,stroke-width:2px;
  classDef fallback fill:#f97316,color:#ffffff,stroke:#ea580c,stroke-width:2px;
  class A image;
  class B action;
  class C action;
  class D image;
  class E fallback;
  class F action;

Install

Install TejOCR-0.2.0.oxt from extension manager.
Restart LibreOffice.
Open TejOCR → Settings and verify dependency status.

Requirements

LibreOffice 4.0+
Tesseract (tesseract command installed)
LibreOffice Python packages:
- numpy
- pytesseract
- pillow

Platform commands

macOS: brew install tesseract
Linux: sudo apt install tesseract-ocr tesseract-ocr-eng
Windows: install Tesseract (UB Mannheim), then install python deps in LibreOffice Python.

If detection fails, set full Tesseract executable path in Settings.

Typical flows

A) Selected image

Select image in Writer.
Open TejOCR → OCR Selected Image.
Choose language/output/preprocessing.
Run and get result.

B) File image or PDF (Batch processing)

Open TejOCR → OCR Image from File.
Select one or more image files and/or PDF documents.
Choose language/output/preprocessing, and toggle Merge bulk/PDF into single output if desired.
Run and get result. PDFs are rendered page-by-page, and file/PDF batches can run in bounded parallel workers.

Troubleshooting quick wins

Dependency errors

Missing python packages: install in LibreOffice Python and restart.
Missing Tesseract path: verify with Settings and environment/path.

Inserted text lands at document end

This happens when saved cursor anchor is not recoverable.
Use new_text_box mode for reliable placement.

Empty OCR result

increase contrast/binarization settings,
scale up (1.2 to 1.5),
for PDFs, try Accuracy preset to use 300 DPI rendering,
verify installed language data.

Extension card shows raw XML / tiny icon / stale metadata

This usually means metadata cache or manifest mismatch.

Quit LibreOffice.
Clear ~/Library/Application Support/LibreOffice/*/user/uno_packages/cache/uno_packages/.
Uninstall/reinstall from a freshly built .oxt.
Restart and recheck Extension Manager details.

Documentation map

Root index:

TECHNICAL.md: architecture + function-level runtime map.
CODEMAP.md: module ownership map.
DEVELOPER_GUIDE.md: build/packaging guidance.
FUNCTIONALITY.md: user workflow.
CHANGELOG.md: release notes and shipped changes.

Deep docs:

docs/architecture/overview.md
docs/architecture/dispatch-flow.md
docs/reference/method-map.md
docs/reference/uno-apis.md
docs/flow/selected-image-ocr.md
docs/flow/file-image-ocr.md
docs/reference/output-modes.md
docs/reference/ocr-options-and-engine-tuning.md
docs/dev/ocr-hardening-checklist.md
docs/dev/tejocr-ui-alignment-plan.md
docs/dev/security-review.md
docs/troubleshooting/installation.md
docs/troubleshooting/dialog-fallbacks.md

Reading order:

README → TECHNICAL → docs/architecture → docs/flow → docs/troubleshooting

Notes on docs format

All major technical docs are intentionally dual-form:

ASCII flow blocks for terminal/code review readability.
Mermaid diagrams for quick visual scanning and review contexts.

About the Name

Tej (तेज) in Sanskrit and other Indian languages means light, effulgence, sharpness, or brilliance. TejOCR aims to bring clarity and insight to your documents by making the text within images accessible and editable.

Maintainer: Devansh Varshney
GitHub: varshneydevansh
Twitter: @varshneydevansh

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
META-INF		META-INF
__pycache__		__pycache__
dialogs		dialogs
dist		dist
docs		docs
icons		icons
images		images
l10n		l10n
python/tejocr		python/tejocr
scripts		scripts
tests		tests
.gitignore		.gitignore
Addons.xcu		Addons.xcu
CHANGELOG.md		CHANGELOG.md
CODEMAP.md		CODEMAP.md
DEVELOPER_GUIDE.md		DEVELOPER_GUIDE.md
FUNCTIONALITY.md		FUNCTIONALITY.md
LICENSE		LICENSE
LibreOffice Developer's Guide_ Chapter 4 - Extensions - The Document Foundation Wiki.html		LibreOffice Developer's Guide_ Chapter 4 - Extensions - The Document Foundation Wiki.html
ProtocolHandler.xcu		ProtocolHandler.xcu
README.md		README.md
build.py		build.py
build_tejocr.py		build_tejocr.py
description.xml		description.xml
description_en.txt		description_en.txt
description_hi.txt		description_hi.txt
generate_icons.py		generate_icons.py
generate_translations.py		generate_translations.py
install_dependencies.py		install_dependencies.py
run_libreoffice_with_extension.py		run_libreoffice_with_extension.py
technical.md		technical.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TejOCR

What's New In 0.2.0

UI snapshots

Settings Main UI

Help UI

Setup & Diagnostics UI

Runtime in one screen (ASCII)

What it supports

Important support note

Install

Requirements

Platform commands

Typical flows

A) Selected image

B) File image or PDF (Batch processing)

Troubleshooting quick wins

Dependency errors

Inserted text lands at document end

Empty OCR result

Extension card shows raw XML / tiny icon / stale metadata

Documentation map

Notes on docs format

About the Name

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TejOCR

What's New In 0.2.0

UI snapshots

Settings Main UI

Help UI

Setup & Diagnostics UI

Runtime in one screen (ASCII)

What it supports

Important support note

Install

Requirements

Platform commands

Typical flows

A) Selected image

B) File image or PDF (Batch processing)

Troubleshooting quick wins

Dependency errors

Inserted text lands at document end

Empty OCR result

Extension card shows raw XML / tiny icon / stale metadata

Documentation map

Notes on docs format

About the Name

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages