Skip to content

Commit d5a973a

Browse files
Update README.md
1 parent 3db84cf commit d5a973a

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ PDFs is good source of data, most of the organization release their data in PDFs
1010
*As you know PDF Processing comes under text analytics.*
1111

1212

13-
Most of the Text Analytics Library or frameworks are designed in Python only, this gives a leverage on text analytics. One more thing you can never process a pdf directly in exising frameworks of Machine Learning or Natural Language Processing. Unless they are proving explicit interface for this, **we have to convert pdf to text first.**
13+
Most of the Text Analytics Library or frameworks are designed in Python only, this gives a leverage on text analytics. You can never process a pdf directly in exising frameworks of Machine Learning or Natural Language Processing. Unless they are proving explicit interface for this, **we have to convert pdf to text first.**
1414
## Problematic
1515
Most Python Liabiries for Pdf Processing such as PyPDF2 and Pdfminer.six perform in text extraction task, but this performance is limited to a sample PDF document.
1616

0 commit comments

Comments
 (0)