PoC bulk search your pdf files using fuzzy text look up.
Requirements
- Docker
- docker-compose
Run this project
Clone project and submodules:
git clone --recurse-submodules https://github.com/HazemBZ/pdf-fuzzSpin up containers:
docker-compose upAccess app at: http://localhost:88
System architectures are described here.
V0-PoC
- CI/CD: docker-compose -> one click project spin up.
- BE: ETL solution for text lookup -> Faster lookups, extract once use forever.
- FE: Handle queries w/ ReactQuery -> DX.
- FE/BE: Files uploader -> QoL.
V1
- BE: Task Queue solution for files processing -> Seperation of concerns.
- FE/FE: Basic file deduplication
- BE: Refactor into pipelines and orchestrators
- BE: Test code
- BE: (docs) Add diagrams
- CI/CD: Auto migration setup