Skip to content

Commit a6a034e

Browse files
committed
feat: add pdf_bytes_to_text_pages import to enhance PDF processing capabilities
1 parent 302d35f commit a6a034e

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

pdfdol/__init__.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -16,6 +16,7 @@
1616
PdfReader, # just pypdf's PdfReader
1717
PdfFilesReader, # A Mapping giving you a dict-like API to pdf files in a folder (values are lists of text pages).
1818
PdfTextReader, # A Mapping giving you a dict-like API to pdf text in a folder (values are strings (concatenated text pages)).
19+
pdf_bytes_to_text_pages, # A function to convert PDF bytes to a list of text pages
1920
pdf_bytes_to_text, # A function to get the text "equivalent" of pdf bytes
2021
pdf_files_pages_reader_wrap, # A store wrapper to get stores with list of pages as values
2122
pdf_files_text_reader_wrap, # A store wrapper to get stores with aggregated pages text as values

0 commit comments

Comments
 (0)