You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Apr 11, 2025. It is now read-only.
This will export all tables as CSV files at the path specified. Alternatively, you can use ``f='json'``, ``f='excel'``, ``f='html'``, ``f='markdown'`` or ``f='sqlite'``.
83
83
@@ -99,7 +99,7 @@ By default, pypdf_table_extraction only uses the first page of the PDF to extrac
The ``pages`` keyword argument accepts pages as comma-separated string of page numbers. You can also specify page ranges — for example, ``pages=1,4-10,20-30`` or ``pages=1,4-10,20-end``.
105
105
@@ -119,7 +119,7 @@ pypdf_table_extraction supports extracting tables in parrallel using all the ava
119
119
120
120
.. code-block:: console
121
121
122
-
$ camelot --pages all --parallel lattice foo.pdf
122
+
$ pypdf_table_extraction --pages all --parallel lattice foo.pdf
123
123
124
124
.. note:: The reading of the PDF document is parallelized by processing pages by different CPU core.
125
125
Therefore, a document with a low page count could be slower to process in parallel.
@@ -140,7 +140,7 @@ To extract tables from encrypted PDF files you must provide a password when call
pypdf_table_extraction supports PDFs with all encryption types supported by `pypdf`_. This might require installing PyCryptodome. An exception is thrown if the PDF cannot be read. This may be due to no password being provided, an incorrect password, or an unsupported encryption algorithm.
0 commit comments