Skip to content

Commit 6a7e94d

Browse files
committed
Added support for OCR and docx, xlsx, pptx, pdf, jpg, png, gif, zip, tar, rar file types
1 parent 58194d4 commit 6a7e94d

File tree

3 files changed

+10
-4
lines changed

3 files changed

+10
-4
lines changed

readme.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@
1717

1818
### 🦅 HAWK Eye - Highly Advanced Watchful Keeper Eye
1919

20-
HAWK Eye is a powerful and versatile CLI (Command-Line Interface) tool designed to be your vigilant watchkeeper, guarding against potential data breaches and cyber threats across various platforms. Inspired by the precision and vision of majestic birds of prey, HAWK Eye swiftly scans multiple data sources, including S3, MySQL, PostgreSQL, MongoDB, Slack, Redis, Firebase, filesystem, Slack, and Google Cloud buckets (GCS), for Personally Identifiable Information (PII) and secrets.
20+
HAWK Eye is a powerful and versatile CLI (Command-Line Interface) tool designed to be your vigilant watchkeeper, guarding against potential data breaches and cyber threats across various platforms. Inspired by the precision and vision of majestic birds of prey, HAWK Eye swiftly scans multiple data sources, including S3, MySQL, PostgreSQL, MongoDB, Slack, Redis, Firebase, filesystem, Slack, and Google Cloud buckets (GCS), for Personally Identifiable Information (PII) and secrets. It uses text analysis and OCR techniques to go throug most of the documents, database and different file types like docx, xlsx, pptx, pdf, jpg, png, gif, zip, tar, rar, etc.
2121

2222

2323
### Why "HAWK Eye"?

requirements.txt

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,4 +8,10 @@ slack-sdk
88
google-cloud-core
99
google-cloud-storage
1010
pymongo==3.13.0
11-
tinydb==4.8.0
11+
tinydb==4.8.0
12+
pytesseract
13+
Pillow
14+
docx
15+
openpyxl
16+
PyPDF2
17+
patool

setup.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
VERSION = "0.2.6"
1+
VERSION = "0.3.0"
22

33
from setuptools import setup, find_packages
44

@@ -11,7 +11,7 @@
1111
setup(
1212
name='hawk_scanner',
1313
version=VERSION,
14-
description='A powerful scanner to scan your Filesystem, S3, MongoDB, MySQL, PostgreSQL, Redis, Slack, Google Cloud Storage and Firebase storage for PII and sensitive data.',
14+
description='A powerful scanner to scan your Filesystem, S3, MongoDB, MySQL, PostgreSQL, Redis, Slack, Google Cloud Storage and Firebase storage for PII and sensitive data using text and OCR analysis. Hawk-eye can also analyse supports most of the file types like docx, xlsx, pptx, pdf, jpg, png, gif, zip, tar, rar, etc.',
1515
long_description=long_description,
1616
long_description_content_type="text/markdown",
1717
url='https://github.com/rohitcoder/hawk-eye',

0 commit comments

Comments
 (0)