Avoid expensive status update in LuceneOperator #134079

dnhatn · 2025-09-03T18:18:10Z

With doc_partitioning targeting large shards, updating the status of LuceneOperator can be expensive with many slices. This change uses the keys of the partitioning map instead of iterating over all slices in the queue.

dnhatn · 2025-09-03T19:34:02Z

elasticsearchmachine · 2025-09-03T19:34:41Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

martijnvg

LGTM 👍

nik9000 · 2025-09-03T19:53:59Z

x-pack/plugin/esql/compute/src/main/java/org/elasticsearch/compute/lucene/LuceneOperator.java

        sb.append(this.getClass().getSimpleName()).append("[");
-        sb.append("shards = ").append(sortedUnion(processedShards, sliceQueue.remainingShardsIdentifiers()));
+        sb.append("shards = ")
+            .append(sliceQueue.partitioningStrategies().keySet().stream().sorted().collect(Collectors.joining(",", "[", "]")));


I think it was a mistake to stuff status-like-things things into toString. We have this information in the status already as a Map. Maybe we should just do getClass().getSimpleName() and call it good. If map_page_size isn't in the status we can put it there. This was all useful in a time before we had .status(). But we have it now and we don't need all the string stuff.

++ I can look into this later.

dnhatn · 2025-09-03T20:34:33Z

@martijnvg @nik9000 Thanks!

Avoid expensive status update in LuceneOperator

b728e01

elasticsearchmachine added the v9.2.0 label Sep 3, 2025

dnhatn added >non-issue :Analytics/ES|QL AKA ESQL labels Sep 3, 2025

dnhatn requested review from idegtiarenko and martijnvg September 3, 2025 19:34

dnhatn marked this pull request as ready for review September 3, 2025 19:34

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Sep 3, 2025

dnhatn requested a review from nik9000 September 3, 2025 19:36

martijnvg approved these changes Sep 3, 2025

View reviewed changes

nik9000 approved these changes Sep 3, 2025

View reviewed changes

dnhatn merged commit c007e8c into elastic:main Sep 3, 2025
33 checks passed

dnhatn deleted the lucene-status branch September 3, 2025 20:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid expensive status update in LuceneOperator #134079

Avoid expensive status update in LuceneOperator #134079

Uh oh!

dnhatn commented Sep 3, 2025 •

edited

Loading

Uh oh!

dnhatn commented Sep 3, 2025

Uh oh!

elasticsearchmachine commented Sep 3, 2025

Uh oh!

martijnvg left a comment

Uh oh!

nik9000 Sep 3, 2025

Uh oh!

dnhatn Sep 3, 2025

Uh oh!

dnhatn commented Sep 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Avoid expensive status update in LuceneOperator #134079

Avoid expensive status update in LuceneOperator #134079

Uh oh!

Conversation

dnhatn commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dnhatn commented Sep 3, 2025

Uh oh!

elasticsearchmachine commented Sep 3, 2025

Uh oh!

martijnvg left a comment

Choose a reason for hiding this comment

Uh oh!

nik9000 Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

dnhatn Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

dnhatn commented Sep 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dnhatn commented Sep 3, 2025 •

edited

Loading