How to disable OCR and optimize performance in Docling-serve with Docker? #2173
-
Hello everyone, I'm currently using Docling-serve and while Docling is excellent, the processing speed is quite slow for my use case. I would like to optimize Docling-serve by:
I can easily configure these settings in Python code, but I'm not sure how to properly set them up in a Docker deployment for Docling-serve.
Also, I'd like to enable multicore and GPU (not sure if it's needed for plain text though...) to increase processing speed as much as possible. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 8 replies
-
Hi @Cyp9715! I'm Dosu and I’m helping the docling team. To use the DoclingParseV2DocumentBackend engine in Docker, set the environment variable For optimal performance, keep these settings in your Docker command:
Your Docker command would look like: docker run --name docling-serve \
--network=ainet \
--gpus all \
--restart unless-stopped \
-d -p 5001:5001 \
-e DOCLING_SERVE_ENGINE=DoclingParseV2DocumentBackend \
-e DOCLING_SERVE_MAX_SYNC_WAIT=1200 \
-e DOCLING_SERVE_MAX_DOCUMENT_TIMEOUT=1200 \
-e DOCLING_SERVE_ENG_LOC_NUM_WORKERS=2 \
-e OMP_NUM_THREADS=4 \
-e MKL_NUM_THREADS=4 \
quay.io/docling-project/docling-serve-cu128 If you want to further tune performance, you can increase the worker and thread counts to match your hardware. Disabling enrichment steps and image generation (if not needed) can also help (source). For most plain text workflows, CPU multicore scaling is the main lever for speed. To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
-
@dosu
|
Beta Was this translation helpful? Give feedback.
I have opened an issue open-webui/open-webui#17148