Memory usage stays high after recognition in docling serve – is this expected? #2115

inbfor · 2025-08-21T09:29:19Z

inbfor
Aug 21, 2025

For some reason, memory usage doesn't go back after document recognition - because of this we have OOM issue. Is this expected and if not, what can we do? As an OCR engine we use EasyOCR.

@inbfor · 2025-08-21T09:32:59Z

dosubot[bot]
bot Aug 21, 2025

Hi @inbfor! I'm Dosu and I’m helping the docling team.

High memory usage after recognition in docling-serve with EasyOCR is not expected and is usually a sign that memory cleanup isn't happening after each job. The code does not currently perform explicit memory cleanup—there are no calls to torch.cuda.empty_cache(), gc.collect(), or deletion of large objects after OCR tasks, which can lead to OOM issues, especially under heavy load or repeated use.

To fix this, add GPU memory cleanup in the finally block of the AsyncLocalWorker.loop() method (in docling_serve/engines/async_local/worker.py). For example:

import torch

# ... inside the finally block:
torch.cuda.empty_cache()

If you create job-specific models or tensors, explicitly delete them with del before calling torch.cuda.empty_cache() to ensure memory is released after every task. This approach is confirmed to resolve persistent memory usage and OOM problems in similar setups [source].

For large documents, consider processing in smaller chunks (using page_range or splitting files) and use default image scale unless higher accuracy is needed to further reduce memory footprint [source].

To reply, just mention @dosu.

^{How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other}

0 replies

dolfim-ibm · 2025-08-21T09:34:37Z

dolfim-ibm
Aug 21, 2025
Maintainer

Are you comparing with the CLI usage of Docling? Can you please post which version of Docling you are comparing and which version of Docling Serve?

The last releases (of yesterday) just fixed a memory issue in the docling-parse-v4 which is default in Docling Serve but not in the Docling CLI.

1 reply

inbfor Aug 21, 2025
Author

No, we are not comparing with CLI usage od Docling - we use docling-serve with OpenWebUI.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Memory usage stays high after recognition in docling serve – is this expected? #2115

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Memory usage stays high after recognition in docling serve – is this expected? #2115

Uh oh!

inbfor Aug 21, 2025

Replies: 2 comments · 1 reply

Uh oh!

dosubot[bot] bot Aug 21, 2025

Uh oh!

dolfim-ibm Aug 21, 2025 Maintainer

Uh oh!

Uh oh!

inbfor Aug 21, 2025 Author

inbfor
Aug 21, 2025

Replies: 2 comments 1 reply

dosubot[bot]
bot Aug 21, 2025

dolfim-ibm
Aug 21, 2025
Maintainer

inbfor Aug 21, 2025
Author