Skip to content
This repository was archived by the owner on Jan 16, 2025. It is now read-only.

Commit 66eca1d

Browse files
Merge pull request #129 from cohere-ai/justin-lee/document-parsing
Add Document Parsing Notebook
2 parents d485cc0 + d96b144 commit 66eca1d

9 files changed

+6569
-0
lines changed

notebooks/Document_Parsing_For_Enterprises.ipynb

Lines changed: 1827 additions & 0 deletions
Large diffs are not rendered by default.

notebooks/data/document-parsing/aws-parsed-fda-approved-drug.txt

Lines changed: 838 additions & 0 deletions
Large diffs are not rendered by default.
Binary file not shown.

notebooks/data/document-parsing/gcp-parsed-fda-approved-drug.txt

Lines changed: 884 additions & 0 deletions
Large diffs are not rendered by default.

notebooks/data/document-parsing/llamaparse-markdown-parsed-fda-approved-drug.txt

Lines changed: 574 additions & 0 deletions
Large diffs are not rendered by default.

notebooks/data/document-parsing/llamaparse-text-parsed-fda-approved-drug.txt

Lines changed: 692 additions & 0 deletions
Large diffs are not rendered by default.

notebooks/data/document-parsing/pytesseract-parsed-fda-approved-drug.txt

Lines changed: 1167 additions & 0 deletions
Large diffs are not rendered by default.

notebooks/data/document-parsing/results-table.csv

Lines changed: 586 additions & 0 deletions
Large diffs are not rendered by default.

notebooks/data/document-parsing/unstructured-io-parsed-fda-approved-drug.txt

Lines changed: 1 addition & 0 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)