Skip to content

PoC bulk search you pdf files using text look up

Notifications You must be signed in to change notification settings

HazemBZ/pdf-fuzz

Repository files navigation

pdf-fuzz

PoC bulk search your pdf files using fuzzy text look up.

Setup

Requirements

  • Docker
  • docker-compose

Run this project

Clone project and submodules:

git clone --recurse-submodules https://github.com/HazemBZ/pdf-fuzz

Spin up containers:

docker-compose up

Access app at: http://localhost:88

Documentation

System architectures are described here.

TODOs

V0-PoC

  • CI/CD: docker-compose -> one click project spin up.
  • BE: ETL solution for text lookup -> Faster lookups, extract once use forever.
  • FE: Handle queries w/ ReactQuery -> DX.
  • FE/BE: Files uploader -> QoL.

V1

  • BE: Task Queue solution for files processing -> Seperation of concerns.
  • FE/FE: Basic file deduplication
  • BE: Refactor into pipelines and orchestrators
  • BE: Test code
  • BE: (docs) Add diagrams
  • CI/CD: Auto migration setup

About

PoC bulk search you pdf files using text look up

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published