You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* version and changelog
* stage test
* version bump
* stage notbook
* pip compile
* add skip support param
* OCD refactor parameters
* add one more todo
* note ticket number
* nvm no need a ticket
* Revert "OCD refactor parameters"
This reverts commit 3d66c6a.
* json looks good need to remove field for csv
* Revert "Revert "OCD refactor parameters""
This reverts commit ddd953b.
* sus notebook output
* okay to have file
* use index
* unit test for param
* readme
* note nit
* no need to add all param in parallel test
* missing dot
* new make pip compile
* Update test_general/api/test_app.py
Co-authored-by: shreyanid <[email protected]>
* pass empty param in readme example
* lets bump to a new version
---------
Co-authored-by: shreyanid <[email protected]>
Copy file name to clipboardExpand all lines: README.md
+16Lines changed: 16 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -111,6 +111,22 @@ To extract the table structure from PDF files using the `hi_res` strategy, ensur
111
111
| jq -C . | less -R
112
112
```
113
113
114
+
#### Skip Table Extraction
115
+
116
+
Currently, we provide support for enabling and disabling table extraction for file types other than PDF files. Set parameter `skip_infer_table_types` to specify the document types that you want to skip table extraction with. By default, we skip table extraction
117
+
for PDFs and Images, which are `pdf`, `jpg` and `png`. Again, please note that table extraction only works with `hi_res` strategy. For example, if you don't want to skip table extraction for images, you can pass an empty value to `skip_infer_table_types`with:
0 commit comments