Tika OCR Feature Request #7473
captn-hook
started this conversation in
Feature Requests & Suggestions
Replies: 2 comments
-
Apache Tika is very powerful and more lightweight than LLM based OCR, so +1 on this request |
Beta Was this translation helpful? Give feedback.
0 replies
This comment was marked as spam.
This comment was marked as spam.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Tika with Tesseract OCR service.
Summary
Add a locally hostable OCR service. This will allow an entire rag stack to be run without an outside services.
Tika can be loaded as a docker service and requires a simple config to get Tesseract OCR running.
Change Type
New feature (non-breaking change which adds functionality)
Beta Was this translation helpful? Give feedback.
All reactions