Auto Image Occlusion - Anki Addon

Automatically detect and occlude text regions in images using Tesseract OCR

Automatically detect text regions in images and create Image Occlusion shapes with a single click. Works seamlessly with Anki's native Image Occlusion feature (Anki 25.09+).

Inspired by: logseq-anki-sync

✨ Features

🪄 One-Click Detection: Auto-detect text regions with a single button click
🎨 Native Integration: Seamlessly integrates with Anki's Image Occlusion toolbar
⌨️ Keyboard Shortcut: Quick access via Ctrl+Shift+A
🧠 Smart Detection: Line-based detection with PSM 12 (sparse text with OSD)
🎯 Collision Detection: Automatically skips existing occlusions
📏 Text Length Filtering: Intelligently filters based on average line length
🔧 Configurable: Adjust confidence, size thresholds, and filters
🚀 Persistent UI: Button automatically reappears when selecting new images
🐍 Python Backend: Uses pytesseract for reliable, fast OCR processing

📦 Installation

Prerequisites

1. Anki 25.09 or Later

Ensure you have Anki 25.09+ which includes native Image Occlusion support.

2. Tesseract OCR

Install Tesseract OCR on your system:

Linux:

sudo apt-get install tesseract-ocr

macOS:

brew install tesseract

Windows:

Download installer from GitHub releases
Run installer and note the installation path
Add to PATH: C:\Program Files\Tesseract-OCR

2.1. Additional Language Data (Optional)

Tesseract supports 100+ languages. To use non-English languages, you need to download additional language data files.

Download Language Data:

Visit tessdata repository or tessdata_best repository
Download .traineddata files for your language(s)
Common examples: spa.traineddata (Spanish), fra.traineddata (French), deu.traineddata (German), chi_sim.traineddata (Chinese Simplified)

Install Language Data:

Windows:

Copy .traineddata files to: C:\Program Files\Tesseract-OCR\tessdata\

Linux (Package Install):

# Option 1: Install via package manager
sudo apt-get install tesseract-ocr-spa  # Spanish
sudo apt-get install tesseract-ocr-fra  # French

# Option 2: Manual install
sudo cp *.traineddata /usr/share/tesseract-ocr/tessdata/
# or: /usr/share/tessdata/

macOS (Homebrew):

# Option 1: Install via brew
brew install tesseract-lang  # All languages

# Option 2: Manual install
cp *.traineddata /usr/local/share/tessdata/
# or: /opt/homebrew/share/tessdata/ (Apple Silicon)

Verify Installation:

tesseract --list-langs

Example Config for Spanish:

{
    "tesseract_lang": "spa"
}

Example Config for Multiple Languages:

{
    "tesseract_lang": "eng+spa+fra"
}

Install Addon

Method 1: AnkiWeb (Recommended)

Go to Tools → Add-ons
Click Get Add-ons...
Enter code: 1414192727
Restart Anki

Method 2: Manual Installation

Download or clone this repository
Copy the entire folder to your Anki addons directory:
- Windows: %APPDATA%\Anki2\addons21\auto_image_occlusion
- macOS: ~/Library/Application Support/Anki2/addons21/auto_image_occlusion
- Linux: ~/.local/share/Anki2/addons21/auto_image_occlusion
Restart Anki

🚀 Quick Start

Open Add Cards: In Anki, click Add or press A
Select Image Occlusion: Choose the Image Occlusion note type
Load Your Image: Click the image icon and select your image
Auto-Detect: Click the magic wand button (🪄) or press Ctrl+Shift+A
Wait: OCR processing takes 2-10 seconds depending on image size
Review: Automatically created occlusion boxes appear on text regions
Adjust: Move, resize, or delete boxes as needed
Add: Click "Add" to create your cards

Visual Guide

outputfile.mp4

⚙️ Configuration

Access Config

Tools → Add-ons → Auto Image Occlusion Detection → Config

Default Configuration

{
    "tesseract_lang": "eng",
    "min_confidence": 48,
    "min_width": 4,
    "min_height": 4,
    "min_area_percent": 0.0001,
    "button_shortcut": "Ctrl+Shift+A",
    "vertical_merge_factor": 0.65
}

Configuration Options

Option	Default	Description
`tesseract_lang`	`"eng"`	OCR language code(s). Use `"eng+fra"` for multiple languages
`min_confidence`	`48`	Minimum OCR confidence (0-100). Lower = more detections
`min_width`	`4`	Minimum box width in pixels
`min_height`	`4`	Minimum box height in pixels
`min_area_percent`	`0.0001`	Minimum box area as % of image (0.01 = 1%)
`button_shortcut`	`"Ctrl+Shift+A"`	Keyboard shortcut for auto-detection
`vertical_merge_factor`	`0.65`	Merge lines within 0.65x average height (handles multi-line labels)

Configuration Examples

For Higher Quality (fewer false positives):

{
    "min_confidence": 60,
    "min_area_percent": 0.001
}

For More Detections (catch more text):

{
    "min_confidence": 35,
    "min_area_percent": 0.00005
}

For Non-English Text (e.g., Spanish):

{
    "tesseract_lang": "spa",
    "min_confidence": 48
}

For Mixed Languages (e.g., English + Chinese):

{
    "tesseract_lang": "eng+chi_sim",
    "min_confidence": 40
}

Disable multi-line merging (treat each line separately):

{
    "vertical_merge_factor": 0
}

More aggressive multi-line merging:

{
    "vertical_merge_factor": 2.5
}

🧠 How It Works

Uses Tesseract's PSM 12 (sparse text with OSD) with line-based grouping for reliable detection.

Best for:

✅ Scattered text elements (diagrams, labels)
✅ Dense text documents
✅ Books and articles
✅ Mixed layouts with varied text positioning
✅ Anatomy diagrams
✅ Flowcharts and infographics

Detection Process:

Detects text using sparse text detection (PSM 12 — sparse text with OSD)
Groups words by text line for granular detection
Merges vertically adjacent lines
Calculates average text length for intelligent filtering
Filters by confidence threshold (min 48)
Filters by text length (ignores lines shorter than avg/2 or 3 chars)
Detects collisions with existing occlusions (backend & frontend)
Creates individual occlusions per text block

Technical Details:

Uses PSM 12 (sparse text with OSD) - optimized for finding scattered text
Line-based grouping provides reliable granularity
Vertical merging handles multi-line labels (e.g., anatomy diagrams)
Each text block (single or multi-line) becomes a separate occlusion
Collision detection prevents duplicate occlusions

🏗️ Architecture

Module Structure

anki addon/
├── __init__.py                 # Package initialization
├── addon.py                    # Main entry point, registers hooks
├── editor_integration.py       # JavaScript injection logic
├── js_builder.py               # JavaScript code generator
├── message_handler.py          # Python ↔ JavaScript communication
├── ocr_engine.py               # Tesseract OCR wrapper (PSM 12, line-based)
├── config.json                 # Default configuration
├── config.md                   # Configuration documentation
├── manifest.json               # Addon metadata
└── README.md                   # This file

Data Flow

1. User opens IO note
   ↓
2. editor_did_load_note hook fires
   ↓
3. Python injects JavaScript (100ms delay)
   ↓
4. JavaScript initializes:
   - Create window.AutoIOAddon namespace
   - Intercept resetIOImageLoaded()
   - Wait for IO editor (MutationObserver)
   - Add button to toolbar
   ↓
5. User clicks button or presses Ctrl+Shift+A
   ↓
6. JavaScript:
   - Capture image element
   - Convert to base64 DataURL
   - Send via pycmd('autoDetectOCR:...')
   ↓
7. Python:
   - Decode image
   -   - Run Tesseract OCR (PSM 12 - sparse text with OSD)
   - Group text by lines
   - Calculate average text length
   - Filter by confidence, size, and text length
   - Detect collisions with existing shapes
   - Return non-colliding JSON results
   ↓
8. JavaScript:
   - Transform coordinates (image → canvas)
   - Double-check overlapping regions (safety measure)
   - Create Rectangle shapes
   - Add to maskEditor
   - Redraw canvas

🔧 Troubleshooting

"Auto-detection failed: OCR timeout"

Symptoms: Error message after clicking button

Solutions:

✅ Image is too large (reduce to ~1920px width)
✅ System is slow (increase timeout in JavaScript config)
✅ Tesseract not installed properly
✅ Check Anki debug console for Python errors

Tesseract Not Found

Symptoms: pytesseract.TesseractNotFoundError

Solutions:

Verify Installation:
```
tesseract --version
```
Add to PATH (Windows):
- System Properties → Environment Variables
- Add C:\Program Files\Tesseract-OCR to PATH
- Restart Anki
Reinstall Tesseract and verify during installation

No Text Detected

Symptoms: "No text regions detected" message

Solutions:

✅ Lower min_confidence (try 30-40)
✅ Lower min_area_percent (try 0.00005)
✅ Ensure image has clear, readable text
✅ Check if correct language is set (tesseract_lang)
✅ Improve image quality/contrast

Poor Detection Accuracy

Symptoms: Too many false positives or missing text

Solutions:

Too many false positives:

Increase min_confidence (try 55-65)
Increase min_area_percent (try 0.001)

Missing text:

Decrease min_confidence (try 35-40)
Decrease min_area_percent (try 0.00001)
Improve image quality/contrast

🤝 Contributing

Contributions are welcome! To contribute:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Make your changes
Test thoroughly in Anki
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

🙏 Credits

Inspiration

logseq-anki-sync - Original auto-detection concept

Dependencies

Tesseract OCR - Text detection engine
pytesseract - Python wrapper for Tesseract
Pillow - Image processing library

Icons

Magic wand icon from Material Design Icons (mdiAutoFix)

📄 License

GNU AGPL v3+ - Same as Anki's license

This addon is free and open-source. See Anki's license for full details.

Made with ❤️ for the Anki community

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Auto Image Occlusion - Anki Addon

✨ Features

📦 Installation

Prerequisites

1. Anki 25.09 or Later

2. Tesseract OCR

2.1. Additional Language Data (Optional)

Install Addon

🚀 Quick Start

Visual Guide

⚙️ Configuration

Access Config

Default Configuration

Configuration Options

Configuration Examples

🧠 How It Works

🏗️ Architecture

Module Structure

Data Flow

🔧 Troubleshooting

"Auto-detection failed: OCR timeout"

Tesseract Not Found

No Text Detected

Poor Detection Accuracy

🤝 Contributing

🙏 Credits

Inspiration

Dependencies

Icons

📄 License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
addon.py		addon.py
config.json		config.json
config.md		config.md
dependency_manager.py		dependency_manager.py
editor_integration.py		editor_integration.py
js_builder.py		js_builder.py
manifest.json		manifest.json
message_handler.py		message_handler.py
ocr_engine.py		ocr_engine.py
requirements.json		requirements.json

License

BEST8OY/Auto-Image-Occlusion-Anki-Addon

Folders and files

Latest commit

History

Repository files navigation

Auto Image Occlusion - Anki Addon

✨ Features

📦 Installation

Prerequisites

1. Anki 25.09 or Later

2. Tesseract OCR

2.1. Additional Language Data (Optional)

Install Addon

🚀 Quick Start

Visual Guide

⚙️ Configuration

Access Config

Default Configuration

Configuration Options

Configuration Examples

🧠 How It Works

🏗️ Architecture

Module Structure

Data Flow

🔧 Troubleshooting

"Auto-detection failed: OCR timeout"

Tesseract Not Found

No Text Detected

Poor Detection Accuracy

🤝 Contributing

🙏 Credits

Inspiration

Dependencies

Icons

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages