Skip to content

Commit 9081d16

Browse files
1. main.sh
2. ignore output of txt file 3. ignore sample 4. update docs about how to run code
1 parent 3d00b7b commit 9081d16

File tree

7 files changed

+16
-863
lines changed

7 files changed

+16
-863
lines changed

.gitignore

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
output/*.txt
2+
samples/*.pdf

README.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,16 @@
22
Python Multiple PDF Documents Text Extraction - Python 3.7
33
![Logo](XPDF.jpg)
44

5+
## CARA eksekusi
6+
sh main.sh
7+
8+
setelah itu cek 'folder/output'
9+
CTRL + H untuk replace
10+
1. \n with space
11+
2. kode <0xsdx>
12+
3.
13+
14+
515
## Introduction
616
**As a Data Scientist , You may not stick to data format.**
717

main.sh

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
python Scripts/merged.py
2+
python Scripts/spliter.py
3+
python Scripts/extract_text.py
4+

output/Output.txt

Lines changed: 0 additions & 863 deletions
This file was deleted.
-10.1 MB
Binary file not shown.

samples/sample-pdf-file.pdf

-143 KB
Binary file not shown.

samples/sample.pdf

-53.6 KB
Binary file not shown.

0 commit comments

Comments
 (0)