Home Index Scrape is an Home Index RPC module that extracts metadata from files. It collects information using tools such as ExifTool, FFmpeg, MediaInfo and Apache Tika and returns structured metadata for indexing in Meilisearch.
The repository provides a docker-compose.yml that starts Home Index, Meilisearch, Apache Tika and this scrape module. After installing Docker run:
docker compose upSample files under bind-mounts/files/ will be scanned and metadata is stored next to them in a metadata folder.
Install the dependencies and launch the module directly:
pip install -r requirements.txt
python packages/home_index_scrape/main.pyBy default the server listens on port 9000. Set NAME to change the module name.
Common environment variables used by the module:
NAME– module name (defaultscrape)DEBUG– set toTruefor verbose loggingWAIT_FOR_DEBUGPY_CLIENT– wait for a debugger to attach before startingTIKA_SERVER_ENDPOINT– URL of the Tika server, e.g.http://tika:9998
For a full description of the RPC interface see the Home Index documentation.