Rework cancellation test for batched query execution #133579

benchaplin · 2025-08-26T19:05:21Z

Before this PR: this test makes a search request with allow_partial_search_results=false then manipulates the execution:

Allows one shard to proceed, and blocks the others
Throws an exception during query execution of that shard
Expects the search to be cancelled (because there are no replicas to retry)

Batched query execution handles cancellations in a slightly worse manner. Instead of retrying (or failing if there are no replicas) immediately, batched queries must wait for all shards in the batch to complete. Then the data node responds to the coordinating node with the batch result, which handles shard failures. Therefore, this test doesn't work for batched, because if only one shard is allowed to proceed and it's part of a batched request, the batch will never complete.

After this PR: this test makes a search request with allow_partial_search_results=false then manipulates the execution:

Allows one node to proceed, and blocks the others
Throws an exception during query execution of one of the shards on that node
Expects the search to be cancelled (because there are no replicas to retry)

elasticsearchmachine · 2025-08-26T19:05:45Z

Pinging @elastic/es-search-foundations (Team:Search Foundations)

benchaplin · 2025-08-26T19:07:54Z

test/framework/src/main/java/org/elasticsearch/test/AbstractSearchCancellationTestCase.java

-                public void onNewReaderContext(ReaderContext c) {
-                    if (runOnNewReaderContext.get() != null) {
-                        runOnNewReaderContext.get().accept(c);
+                public void onPreQueryPhase(SearchContext c) {


I made this change because SearchContext gave me easier access to the node ID.

Both the onNewReaderContext and onPreQueryPhase hooks run before query execution begins, so either will do the job for this test (the only user of SearchShardBlockingPlugin).

javanna

LGTM, I am only wondering whether we should keep the old test and still run it without batched execution, maybe that's overkill, up to you @benchaplin

benchaplin · 2025-08-28T18:10:16Z

Good shout. I suppose the batched setting will be available for users to disable, so I think we should keep the old test. I've randomized the setting.

elasticsearchmachine · 2025-08-28T21:18:36Z

💚 Backport successful

Status	Branch	Result
✅	9.1

Rework cancellation test for batched query execution

80f534d

benchaplin commented Aug 26, 2025

View reviewed changes

javanna approved these changes Aug 27, 2025

View reviewed changes

Randomize the batched query setting

15c7b8f

Merge branch 'main' into fix_batched_search_cancellation_it

176c91d

benchaplin merged commit 1023131 into elastic:main Aug 28, 2025
33 checks passed

benchaplin mentioned this pull request Aug 28, 2025

[9.1] Rework cancellation test for batched query execution (#133579) #133764

Merged

benchaplin added a commit to benchaplin/elasticsearch that referenced this pull request Aug 28, 2025

Rework cancellation test for batched query execution (elastic#133579)

53be69e

benchaplin mentioned this pull request Aug 28, 2025

[Meta] Batched Query Phase Follow-up Tasks #125788

Open

6 tasks

elasticsearchmachine pushed a commit that referenced this pull request Aug 28, 2025

Rework cancellation test for batched query execution (#133579) (#133764)

b983583

JeremyDahlgren pushed a commit to JeremyDahlgren/elasticsearch that referenced this pull request Aug 29, 2025

Rework cancellation test for batched query execution (elastic#133579)

3346509

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rework cancellation test for batched query execution #133579

Rework cancellation test for batched query execution #133579

Uh oh!

benchaplin commented Aug 26, 2025

Uh oh!

elasticsearchmachine commented Aug 26, 2025

Uh oh!

benchaplin Aug 26, 2025

Uh oh!

javanna left a comment

Uh oh!

benchaplin commented Aug 28, 2025

Uh oh!

Uh oh!

elasticsearchmachine commented Aug 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Rework cancellation test for batched query execution #133579

Rework cancellation test for batched query execution #133579

Uh oh!

Conversation

benchaplin commented Aug 26, 2025

Uh oh!

elasticsearchmachine commented Aug 26, 2025

Uh oh!

benchaplin Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

javanna left a comment

Choose a reason for hiding this comment

Uh oh!

benchaplin commented Aug 28, 2025

Uh oh!

Uh oh!

elasticsearchmachine commented Aug 28, 2025

💚 Backport successful

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants