This repository provides an OCR module for Home Index. It exposes an XML-RPC service that reads images and PDF files with EasyOCR. The extracted text is written back to the document so it can be indexed by Home Index.
docker compose upThe provided docker-compose.yml launches Meilisearch, Home Index and this module. Once running, Home Index connects to the module at http://home-index-read:9000.
The module behaviour can be tweaked with the following variables (defaults in brackets):
NAME– module name [read]LANGUAGES– comma separated list of languages [en]MODEL_STORAGE_DIRECTORY– where EasyOCR stores models [/easyocr]WORKERS– number of EasyOCR workers [1]BATCH_SIZE– batch size for OCR [6]GPU– use the GPU if available [torch.cuda.is_available()]PYTORCH_CUDA_ALLOC_CONF– PyTorch CUDA allocation settings [expandable_segments:True]
packages/home_index_read/main.py– module implementationDockerfile– build the Docker imagedocker-compose.yml– example compose file for local testing
See the module specification in the Home Index repository for details on the RPC interface.