Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
40 commits
Select commit Hold shift + click to select a range
535e222
Refactor PDFPlumber
pprados Jan 2, 2025
7733591
Merge branch 'master' into pprados/06-pdfplumber
pprados Feb 11, 2025
581e015
Merge branch 'master' into pprados/06-pdfplumber
pprados Feb 12, 2025
13dad04
Merge branch 'master' into pprados/06-pdfplumber
pprados Feb 14, 2025
b72198b
Refactor PDFPlumber
pprados Jan 2, 2025
1437417
Merge remote-tracking branch 'origin/pprados/06-pdfplumber' into ppra…
pprados Feb 24, 2025
60e364f
Merge branch 'langchain-ai:master' into pprados/06-pdfplumber
pprados Feb 24, 2025
2b7ffd6
Refactor PDFPlumber
pprados Jan 2, 2025
cdd366f
Merge remote-tracking branch 'origin/pprados/06-pdfplumber' into ppra…
pprados Feb 24, 2025
f30ac5d
Update key convention strategy in metadata.
pprados Feb 26, 2025
be47099
Merge branch 'master' into pprados/06-pdfplumber
pprados Feb 26, 2025
898e2a5
Fix test_parser_with_table
pprados Feb 26, 2025
48ef444
Fix test_parser_with_table
pprados Feb 26, 2025
af4fde3
Merge branch 'master' into pprados/06-pdfplumber
pprados Feb 27, 2025
4f9bcf2
Fix test_parser_with_table
pprados Feb 26, 2025
29605cf
Merge remote-tracking branch 'origin/pprados/06-pdfplumber' into ppra…
pprados Feb 27, 2025
2a77d93
Fix test_parser_with_table
pprados Feb 27, 2025
0cf0f70
Merge branch 'master' into pprados/06-pdfplumber
pprados Feb 28, 2025
d82925c
Merge branch 'master' into pprados/06-pdfplumber
pprados Feb 28, 2025
bd3a24f
Merge branch 'master' into pprados/06-pdfplumber
pprados Mar 3, 2025
abf2909
Refactor PDFPlumber
pprados Mar 4, 2025
0fd062f
Merge branch 'master' into pprados/06-pdfplumber
pprados Mar 5, 2025
fa47539
Fix revue
pprados Mar 5, 2025
1bc4c91
Remove commented out code
eyurtsev Mar 7, 2025
76b3d6b
Merge legacy and standard metadata keys in pdf parser.
pprados Mar 7, 2025
89903c8
Merge remote-tracking branch 'origin/pprados/06-pdfplumber' into ppra…
pprados Mar 7, 2025
cae829d
Merge remote-tracking branch 'upstream/master' into pprados/06-pdfplu…
pprados Mar 13, 2025
dd909d2
Fix revue
pprados Mar 13, 2025
38b50e3
Fix revue
pprados Mar 13, 2025
09c4c1f
Fix images parser
pprados Mar 26, 2025
e73e5d0
Merge branch 'master' into pprados/06-pdfplumber
pprados Mar 26, 2025
cefe702
Fix dependencies, after this [PR](https://github.com/jsvine/pdfplumbe…
pprados Mar 28, 2025
97fd65b
Fix empty producer
pprados Apr 2, 2025
2ec0aa3
Merge remote-tracking branch 'upstream/master' into pprados/06-pdfplu…
pprados Apr 3, 2025
6b506df
Fix notebooks
pprados Apr 4, 2025
61b68bb
Merge remote-tracking branch 'upstream/master' into pprados/06-pdfplu…
pprados Apr 15, 2025
ada3ba2
Fix notebooks
pprados Apr 15, 2025
028069c
Fix notebooks
pprados Apr 15, 2025
dbb69b6
Merge remote-tracking branch 'upstream/master' into pprados/06-pdfplu…
pprados Apr 29, 2025
f9420e9
Remove type: ignore
pprados Apr 29, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
113 changes: 57 additions & 56 deletions docs/docs/integrations/document_loaders/pdfminer.ipynb

Large diffs are not rendered by default.

1,514 changes: 1,360 additions & 154 deletions docs/docs/integrations/document_loaders/pdfplumber.ipynb

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion docs/docs/integrations/document_loaders/pymupdf.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
"source": [
"# PyMuPDFLoader\n",
"\n",
"This notebook provides a quick overview for getting started with `PyMuPDF` [document loader](https://python.langchain.com/docs/concepts/document_loaders). For detailed documentation of all __ModuleName__Loader features and configurations head to the [API reference](https://python.langchain.com/api_reference/community/document_loaders/langchain_community.document_loaders.pdf.PyMuPDFLoader.html).\n",
"This sample provides a quick overview for getting started with `PyMuPDF` [document loader](https://python.langchain.com/docs/concepts/document_loaders). For detailed documentation of all PyMuPDFLoader features and configurations head to the [API reference](https://python.langchain.com/api_reference/community/document_loaders/langchain_community.document_loaders.pdf.PyMuPDFLoader.html).\n",
"\n",
" \n",
"\n",
Expand Down
4 changes: 2 additions & 2 deletions libs/community/extended_testing_deps.txt
Original file line number Diff line number Diff line change
Expand Up @@ -59,8 +59,8 @@ openapi-pydantic>=0.3.2,<0.4
oracle-ads>=2.9.1,<3
oracledb>=2.2.0,<3
pandas>=2.0.1,<3
pdfminer-six==20231228
pdfplumber>=0.11
pdfminer-six>=20250324
pdfplumber>=0.11.6
pgvector>=0.1.6,<0.2
playwright>=1.48.0,<2
praw>=7.7.1,<8
Expand Down
Loading
Loading