Add cancellation support to IndicesRequestCache by drempapis · Pull Request #141708 · elastic/elasticsearch

drempapis · 2026-02-03T09:59:10Z

Related github issue #108703

The problem
When expensive queries fill up the search thread pool, threads can become blocked in IndicesRequestCache.getOrCompute waiting for other threads to compute cached results. If these queries are cancelled, the waiting threads don't react to the cancellation and continue blocking indefinitely. This can lead to search thread pool exhaustion, requiring node restarts to recover.

This PR adds cancellation support to the cache's blocking operations, allowing waiting threads to be notified when their task is cancelled. However, this PR does not prevent the search pool from filling with blocking tasks. A follow-up pr will follow this one, changing the cache to use SubscribableListener for a complete async solution. I haven't worked on it here for simplicity and to make the code updates discrete.

elasticsearchmachine · 2026-02-03T10:00:18Z

Pinging @elastic/es-search-foundations (Team:Search Foundations)

…/elasticsearch into fix/cache-cancellation-support

eranweiss-elastic · 2026-03-16T13:50:16Z

server/src/main/java/org/elasticsearch/common/cache/Cache.java

+     * @throws TaskCancelledException if the operation was cancelled
+     */
+    private static <T> T blockOnFuture(CompletableFuture<T> future, Consumer<Runnable> cancellationRegistrar) throws ExecutionException,
+        InterruptedException {


Java doc mentions TaskCancelledException being thrown, but the definition doesn't.

That's true, removed

eranweiss-elastic · 2026-03-16T14:01:29Z

server/src/main/java/org/elasticsearch/common/cache/Cache.java

+    private void cleanupFailedFuture(CacheSegment segment, K key, CompletableFuture<Entry<K, V>> future) {
+        segment.writeLock.lock();
+        try {
+            if (segment.map != null && segment.map.get(key) == future) {


I think that segment.map.get(key) != future deserves some handling. It's not expected, but maybe at least a log.

I added a debug log when the key maps to a different future: Skipped cleanup for key [] because the future was replaced.

Do you think that it is enough? Do you suggest doing something else here?

eranweiss-elastic · 2026-03-16T14:20:00Z

server/src/main/java/org/elasticsearch/common/cache/Cache.java

+        if (cancellationRegistrar != null) {
+            cancellationRegistrar.accept(() -> {
+                cancelled.set(true);
+                latch.countDown();


I'm a bit worried about this if. This assumes that a cancellationRegistrar != null is passed if a task can be cancelled, but doesn't enforce it. If a task is cancelled for a future when cancellationRegistrar is null, this with be a deadlock.
A safer way to do this will be to add the latch.countDown(); regardless of the value of cancellationRegistrar.

The code is structured as follows

future.whenComplete((value, throwable) -> { if (throwable != null) { error.set(throwable); } else { result.set(value); } latch.countDown(); }); if (cancellationRegistrar != null) { cancellationRegistrar.accept(() -> { cancelled.set(true); latch.countDown(); }); }

The whenComplete callback, which always calls latch.countDown(), is registered before the cancellationRegistrar check. The latch is always released when the future completes, regardless of whether a registrar is provided.

When cancellationRegistrar == null, the waiting thread cannot exit early if its task is cancelled. It will remain blocked until the computation finishes. This is by design, not a deadlock.

Add latch.countDown() regardless of cancellationRegistrar would immediately count down the latch causing latch.await() to return instantly with no result.

I've added a test to prove that when cancellationRegistrar is null, the thread is not deadlocked, it simply cannot exit early and must wait for the future to complete.

eranweiss-elastic · 2026-03-16T14:35:26Z

server/src/main/java/org/elasticsearch/common/cache/Cache.java

+     * @throws InterruptedException if the thread was interrupted
+     * @throws TaskCancelledException if the operation was cancelled
+     */
+    private static <T> T blockOnFuture(CompletableFuture<T> future, Consumer<Runnable> cancellationRegistrar) throws ExecutionException,


Because this is a static method, and has locking, I think it would be nice to test it in CacheTests.java.

+1, added more tests

…/elasticsearch into fix/cache-cancellation-support

drempapis added 2 commits February 3, 2026 11:51

add cache cancellation support

4c2c7d3

light updates

8c0fae1

drempapis changed the title ~~Add cancellation support to IndicesRequestCache to prevent search thread pool exhaustion~~ Add cancellation support to IndicesRequestCache Feb 3, 2026

elasticsearchmachine added needs:triage Requires assignment of a team area label v9.4.0 labels Feb 3, 2026

drempapis added >non-issue Team:Search Foundations Meta label for the Search Foundations team in Elasticsearch :Search Foundations/Search Catch all for Search Foundations and removed needs:triage Requires assignment of a team area label labels Feb 3, 2026

[CI] Auto commit changes from spotless

9bf1a41

drempapis mentioned this pull request Feb 3, 2026

IndicesRequestCache uncancellably blocks search threads while result is pending #108703

Open

drempapis added 3 commits February 3, 2026 12:16

Merge branch 'main' into fix/cache-cancellation-support

be5e2a4

update for checkstyle

5acb50d

Merge branch 'main' into fix/cache-cancellation-support

4e37fbe

drempapis requested a review from DaveCTurner February 4, 2026 14:51

drempapis added >bug and removed >non-issue labels Feb 11, 2026

drempapis removed the request for review from DaveCTurner February 25, 2026 08:32

drempapis and others added 11 commits March 13, 2026 11:41

Merge branch 'main' into fix/cache-cancellation-support

635cc8c

Merge branch 'main' into fix/cache-cancellation-support

5793981

Merge branch 'main' into fix/cache-cancellation-support

1ac60de

update code

121f39b

[CI] Auto commit changes from spotless

a0e033f

update after review

16f92d8

Merge branch 'fix/cache-cancellation-support' of github.com:drempapis…

b4ace63

…/elasticsearch into fix/cache-cancellation-support

update after review

7ebc399

[CI] Auto commit changes from spotless

05de3e3

Merge branch 'main' into fix/cache-cancellation-support

104060a

Merge branch 'main' into fix/cache-cancellation-support

8bf32ae

drempapis added 2 commits March 16, 2026 14:10

fix test

914d73e

Merge branch 'main' into fix/cache-cancellation-support

9eefc0a

eranweiss-elastic self-requested a review March 16, 2026 13:28

eranweiss-elastic reviewed Mar 16, 2026

View reviewed changes

drempapis and others added 10 commits March 19, 2026 16:46

Merge branch 'main' into fix/cache-cancellation-support

6e997e6

update after review

90250dc

update after review

d34ad03

Merge branch 'main' into fix/cache-cancellation-support

4dc4309

add tests

728fbb6

Merge branch 'fix/cache-cancellation-support' of github.com:drempapis…

2a85c37

…/elasticsearch into fix/cache-cancellation-support

Merge branch 'main' into fix/cache-cancellation-support

0ee7cad

[CI] Auto commit changes from spotless

b4bd167

Merge branch 'main' into fix/cache-cancellation-support

1530d72

Merge branch 'main' into fix/cache-cancellation-support

6f33bc9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cancellation support to IndicesRequestCache#141708

Add cancellation support to IndicesRequestCache#141708
drempapis wants to merge 29 commits intoelastic:mainfrom
drempapis:fix/cache-cancellation-support

drempapis commented Feb 3, 2026 •

edited

Loading

Uh oh!

elasticsearchmachine commented Feb 3, 2026

Uh oh!

eranweiss-elastic Mar 16, 2026

Uh oh!

drempapis Mar 19, 2026

Uh oh!

eranweiss-elastic Mar 16, 2026

Uh oh!

drempapis Mar 19, 2026

Uh oh!

eranweiss-elastic Mar 16, 2026

Uh oh!

drempapis Mar 20, 2026 •

edited

Loading

Uh oh!

eranweiss-elastic Mar 16, 2026

Uh oh!

drempapis Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

drempapis commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Feb 3, 2026

Uh oh!

eranweiss-elastic Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

drempapis Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

eranweiss-elastic Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

drempapis Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

eranweiss-elastic Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

drempapis Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eranweiss-elastic Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

drempapis Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

drempapis commented Feb 3, 2026 •

edited

Loading

drempapis Mar 20, 2026 •

edited

Loading