The E-Office Deduplication Hub is an intelligent document management platform built with React, TypeScript, and Supabase. It streamlines digital archives by automatically detecting and removing duplicate files through a combination of MD5 hashing for exact matches and NLP-powered cosine similarity for content-based comparisons.
Contributions are welcome! Please feel free to submit a Pull Request.