Fix SearchErrorTraceIT and friends to work with batched query execution #132227

benchaplin · 2025-07-30T22:48:05Z

CI is borked in #127150 so creating a duplicate PR here.

Making this work with batched execution and fixing a memory leak: * Fix memory leak by removing listener on first message. There really only is a single message here per node anyway with batched execution in the mix. Either it's a single shard on the data node and we get a single query message or it's multiple shards and we get a single batched message, so fine to remove listener after the first message since all tests do a single request only anyway. * Add a new hook that allows inspection of the actual response. This is needed for batched since batched sends a non-error response even if the data node failed all searches. We had this before in the `onResponseSent` hook but checking the instance after it's been sent over the wire causes needless overhead in the production code so moving to a "before-style" hook here.

elasticsearchmachine · 2025-07-30T22:48:30Z

Pinging @elastic/es-search-foundations (Team:Search Foundations)

javanna · 2025-08-12T14:15:27Z

server/src/main/java/org/elasticsearch/transport/TransportMessageListener.java

+     * @param action the request action
+     * @param response response instance
+     */
+    default void onBeforeResponseSent(long requestId, String action, TransportResponse response) {}


I am not sure I follow the issue in the first place, as well as the proposed fix. The proposal is to add a new default method to TransportMessageListener, to then only ever use it in a test? Could you help me better understand what other options we have?

Async/SearchErrorTraceIT forces an error on all shards during a search. Before batched execution, this would trigger onResponseSent(long requestId, String action, Exception error) - called for every failed action response. We could then implement onResponseSent and inspect the exception to assert things (like whether the stack trace was present).

Batched execution sends a non-error response even if all searches fail. We still want to inspect this response to assert the same things that were once captured in the exception. But TransportMessageListener has no hook to inspect non-error payloads, so this PR introduces one.

hey Ben, sorry for the delay. I understand the problem better. I think this is a bit of an intrusive fix though. Isn't there another way to intercept responses in this specific test as opposed to using transport message listener? You may want to reach out to the distrib team and ask for advice on this.

@javanna I've reworked the tests using a nice suggestion from @DaveCTurner - thanks for pushing for a better solution!

javanna

LGTM! good work!

original-brownbear and others added 6 commits April 22, 2025 15:45

Merge branch 'main' into fixup-search-error-trace-it

ce6435c

Convert BytesTransportResponse

b185eb1

Merge branch 'main' into fixup-search-error-trace-it

7684209

Merge branch 'main' into fixup-search-error-trace-it

925d730

Merge branch 'main' into fixup-search-error-trace-it

2ac4e98

benchaplin added >test Issues or PRs that are addressing/adding tests Team:Search Foundations Meta label for the Search Foundations team in Elasticsearch :Search Foundations/Search Catch all for Search Foundations v9.2.0 labels Jul 30, 2025

Merge branch 'main' into fixup-search-error-trace-it

ebca2bb

benchaplin mentioned this pull request Jul 31, 2025

Fix SearchErrorTraceIT and friends to work with batched query execution #127150

Closed

Merge branch 'main' into fixup-search-error-trace-it

a36a3ca

javanna reviewed Aug 12, 2025

View reviewed changes

benchaplin mentioned this pull request Aug 12, 2025

[Meta] Batched Query Phase Follow-up Tasks #125788

Closed

6 tasks

benchaplin requested a review from javanna August 26, 2025 20:09

benchaplin added 3 commits September 2, 2025 15:04

Adjust approach for intercepting transport responses

e647f43

Merge branch 'main' into fixup-search-error-trace-it

c9d625f

Merge branch 'main' into fixup-search-error-trace-it

9e5cfe8

javanna approved these changes Sep 3, 2025

View reviewed changes

benchaplin added 5 commits September 3, 2025 15:38

Merge branch 'main' into fixup-search-error-trace-it

215b030

Merge branch 'main' into fixup-search-error-trace-it

512fce1

Merge branch 'main' into fixup-search-error-trace-it

3e68c7b

Merge branch 'main' into fixup-search-error-trace-it

b0791f1

Merge branch 'main' into fixup-search-error-trace-it

f19d164

benchaplin merged commit 9eadfac into elastic:main Sep 4, 2025
33 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix SearchErrorTraceIT and friends to work with batched query execution #132227

Fix SearchErrorTraceIT and friends to work with batched query execution #132227

Uh oh!

benchaplin commented Jul 30, 2025

Uh oh!

elasticsearchmachine commented Jul 30, 2025

Uh oh!

javanna Aug 12, 2025

Uh oh!

benchaplin Aug 13, 2025 •

edited

Loading

Uh oh!

javanna Aug 27, 2025

Uh oh!

benchaplin Sep 2, 2025

Uh oh!

javanna left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Fix SearchErrorTraceIT and friends to work with batched query execution #132227

Fix SearchErrorTraceIT and friends to work with batched query execution #132227

Uh oh!

Conversation

benchaplin commented Jul 30, 2025

Uh oh!

elasticsearchmachine commented Jul 30, 2025

Uh oh!

javanna Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

benchaplin Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

javanna Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

benchaplin Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

javanna left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

benchaplin Aug 13, 2025 •

edited

Loading