File tree Expand file tree Collapse file tree 1 file changed +6
-2
lines changed
Expand file tree Collapse file tree 1 file changed +6
-2
lines changed Original file line number Diff line number Diff line change @@ -7,16 +7,21 @@ The following endpoints are provided by the *documents_extractor*:
77# Requirements
88All required python libraries can be found in the [ pyproject.toml] ( pyproject.toml ) file.
99In addition to python libraries the following system packages are required:
10- ```
10+
11+ ``` shell
1112build-essential
1213make
1314ffmpeg
1415poppler-utils
1516tesseract-ocr
1617tesseract-ocr-deu
1718tesseract-ocr-eng
19+ libleptonica-dev
20+ pkg-config
1821```
1922
23+ The Tesseract data path is set via ` TESSDATA_PREFIX=/usr/share/tesseract-ocr/5/tessdata ` in both prod and dev images.
24+
2025# Endpoints
2126
2227## ` /extract `
@@ -31,4 +36,3 @@ The following types of information will be extracted:
3136A detailed explanation of the deployment can be found in the [ project README] ( ../../README.md ) .
3237The * helm-chart* used for the deployment can be found in the [ infrastructure directory] ( ../../infrastructure/ ) .
3338
34-
You can’t perform that action at this time.
0 commit comments