ESQL: Fill in topn values if competitive #135734

nik9000 · 2025-09-30T21:03:23Z

Skip filling in the topn values only if the row is competitive. This cuts the runtime of topn pretty significantly. That's important when topn is dominating the runtime, like we see when querying many many indices at once.

We can emulate that a little locally with something like:

rm /tmp/fields
for field in {1..500}; do
    echo -n ',"f'$field'": "foo"' >> /tmp/fields
done

for idx in {1..100}; do
    curl -uelastic:password -XDELETE localhost:9200/test$idx

    echo '{
        "settings": {
            "index.mapping.total_fields.limit": 10000
        },
        "mappings": {
            "properties": {
                "@timestamp": { "type": "date" }
    ' > /tmp/idx
    for field in {1..500}; do
        echo ',"f'$field'": { "type": "keyword" }' >> /tmp/idx
    done
    echo '
                }
        }
    }' >> /tmp/idx
    curl -uelastic:password -XPUT -HContent-Type:application/json localhost:9200/test$idx --data @/tmp/idx

    rm /tmp/bulk
    for doc in {1..1000}; do
        echo '{"index":{}}' >> /tmp/bulk
        echo -n '{"@timestamp": '$(($idx * 10000 + $doc)) >> /tmp/bulk
        cat /tmp/fields >> /tmp/bulk
        echo '}' >> /tmp/bulk
    done
    echo
    curl -s -uelastic:password -XPOST -HContent-Type:application/json "localhost:9200/test$idx/_bulk?refresh&pretty" --data-binary @/tmp/bulk | tee /tmp/bulk_result | grep error
    echo
done

while true; do
    curl -s -uelastic:password -XPOST -HContent-Type:application/json 'localhost:9200/_query?pretty' -d'{
        "query": "FROM *",
        "pragma": {
            "max_concurrent_shards_per_node": 100
        }
    }' | jq .took

    curl -s -uelastic:password -XPOST -HContent-Type:application/json 'localhost:9200/_query?pretty' -d'{
        "query": "FROM * | SORT @timestamp DESC",
        "pragma": {
            "max_concurrent_shards_per_node": 100
        }
    }' | jq .took
done

This only spends about 12.6% of it's time on topn and takes 2.7 seconds locally. If we apply this fix we spend 3.6% of our time on topn, taking 2.5 seconds. That's not a huge improvement. 7% is nothing to sneeze at, but it's not great. But the topn is dropping from 340 millis to 90 millis.

But in some summary clusters I'm seeing 65% of time spent on topn for queries taking 3 seconds. My kind of bad math says this improvement should drop this query to 1.6 seconds. Let's hope!

Hopefully our nightlies will see this and enjoy prove my math right.

Have you signed the contributor license agreement?
Have you followed the contributor guidelines?
If submitting code, have you built your formula locally prior to submission with gradle check?
If submitting code, is your pull request against main? Unless there is a good reason otherwise, we prefer pull requests against main and will backport as needed.
If submitting code, have you checked that your submission is for an OS and architecture that we support?
If you are submitting this code for a class then read our policy for that.

@timestamp

Skip filling in the topn *values* only if the row is competitive. This cuts the runtime of topn pretty significantly. That's important when topn is dominating the runtime, like we see when querying many many indices at once. We can emulate that a little locally with something like: ``` rm /tmp/fields for field in {1..500}; do echo -n ',"f'$field'": "foo"' >> /tmp/fields done for idx in {1..100}; do curl -uelastic:password -XDELETE localhost:9200/test$idx echo '{ "settings": { "index.mapping.total_fields.limit": 10000 }, "mappings": { "properties": { "@timestamp": { "type": "date" } ' > /tmp/idx for field in {1..500}; do echo ',"f'$field'": { "type": "keyword" }' >> /tmp/idx done echo ' } } }' >> /tmp/idx curl -uelastic:password -XPUT -HContent-Type:application/json localhost:9200/test$idx --data @/tmp/idx rm /tmp/bulk for doc in {1..1000}; do echo '{"index":{}}' >> /tmp/bulk echo -n '{"@timestamp": '$(($idx * 10000 + $doc)) >> /tmp/bulk cat /tmp/fields >> /tmp/bulk echo '}' >> /tmp/bulk done echo curl -s -uelastic:password -XPOST -HContent-Type:application/json "localhost:9200/test$idx/_bulk?refresh&pretty" --data-binary @/tmp/bulk | tee /tmp/bulk_result | grep error echo done while true; do curl -s -uelastic:password -XPOST -HContent-Type:application/json 'localhost:9200/_query?pretty' -d'{ "query": "FROM *", "pragma": { "max_concurrent_shards_per_node": 100 } }' | jq .took curl -s -uelastic:password -XPOST -HContent-Type:application/json 'localhost:9200/_query?pretty' -d'{ "query": "FROM * | SORT @timestamp DESC", "pragma": { "max_concurrent_shards_per_node": 100 } }' | jq .took done ``` This only spends about 12.6% of it's time on topn and takes 2.7 seconds locally. If we apply this fix we spend 3.6% of our time on topn, taking 2.5 seconds. That's not a huge improvement. 7% is nothing to sneeze at, but it's not great. But the topn is dropping from 340 millis to 90 millis. But in some summary clusters I'm seeing 65% of time spent on topn for queries taking 3 seconds. My kind of bad math says this improvement should drop this query to 1.6 seconds. Let's hope! Hopefully our nightlies will see this and enjoy prove my math right.

elasticsearchmachine · 2025-09-30T21:03:48Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

elasticsearchmachine · 2025-09-30T21:03:49Z

Hi @nik9000, I've created a changelog YAML for you.

nik9000 · 2025-10-01T12:50:00Z

.../plugin/esql/compute/src/main/java/org/elasticsearch/compute/operator/topn/TopNOperator.java

+                    spareValuesPreAllocSize = Math.max(spare.values.length(), spareValuesPreAllocSize / 2);
+                    inputQueue.updateTop(spare);
+                    spare = nextSpare;
+                }


I could have used the slightly shorter:

Row inserted = spare; spare = inputQueue.insertWithOverflow(spare); if (inserted != spare) { rowFiller.writeValues(i, spare); spareValuesPreAllocSize = Math.max(spare.values.length(), spareValuesPreAllocSize / 2); }

but this feels less magic. And this is the hot path so I prefer seeing the guts a little bit.

Also, it cries out for a further optimization where we bail from the loop as soon as inputQueue.size() < inputQueue.topCount and then make another loop with inputQueue.lessThan(inputQueue.top(), spare).

I think the shorter one has the definite of making the common parts (the code inside the if) more obvious. Perhaps just extract Math.max(spare.values.length(), spareValuesPreAllocSize / 2); to a helper function?

ivancea

LGTM!

ivancea · 2025-10-01T12:54:36Z

...in/esql/compute/src/test/java/org/elasticsearch/compute/operator/topn/TopNOperatorTests.java

            // 1 is for the min-heap itself
-            assertThat(breaker.getMemoryRequestCount(), is(106L));
+            // could be less than because we don't always insert
+            assertThat(breaker.getMemoryRequestCount(), lessThanOrEqualTo(106L));


We're making a performance improvement, and at the same time making the tests more lax, with no extra cases (No functional change + less/laxer testing == ⚠️).
Should we add a more specific case for the expected usage? Maybe something less randomized (Or not randomized at all). Or try to calculate the usage in this test (which feels a bit too intrincated).
Just a gut feeling, consider it a nitpick

let me double check. I was getting 105 sometimes and 106. which, maybe I should just make it either 105, 106 in that case. I'd assumed it was because we don't insert every time. But the input isn't randomized so I'm not entirely sure why. checking.

ivancea · 2025-10-01T13:06:08Z

.../plugin/esql/compute/src/main/java/org/elasticsearch/compute/operator/topn/TopNOperator.java

-                spareValuesPreAllocSize = Math.max(spare.values.length(), spareValuesPreAllocSize / 2);

-                spare = inputQueue.insertWithOverflow(spare);
+                if (inputQueue.size() < inputQueue.topCount) {


Nit: Maybe worth commenting here that this is a insertWithOverflow() that skips some work if the value is not competitive

dnhatn

Great find!

…_fill_topn_later

nik9000 · 2025-10-03T12:22:05Z

Great find!

It was @GalLalouche's find actually. I'd thought we'd do it but, alas, no. Now it's in though!

nik9000 added 2 commits September 30, 2025 16:52

comments

83f9ed1

nik9000 requested a review from GalLalouche September 30, 2025 21:03

nik9000 added >enhancement :Analytics/ES|QL AKA ESQL v9.2.0 labels Sep 30, 2025

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Sep 30, 2025

Update docs/changelog/135734.yaml

16cdf9c

nik9000 added 2 commits September 30, 2025 17:38

Merge branch 'main' into esql_fill_topn_later

025e7ab

Merge branch 'main' into esql_fill_topn_later

37a9229

nik9000 commented Oct 1, 2025

View reviewed changes

ivancea approved these changes Oct 1, 2025

View reviewed changes

dnhatn approved these changes Oct 1, 2025

View reviewed changes

elasticsearchmachine added v9.3.0 and removed v9.2.0 labels Oct 2, 2025

nik9000 added 3 commits October 2, 2025 14:34

Better lock

04ac9dd

Merge remote-tracking branch 'nik9000/esql_fill_topn_later' into esql…

1c1fb18

…_fill_topn_later

Merge branch 'main' into esql_fill_topn_later

215ff81

nik9000 merged commit ea64bf4 into elastic:main Oct 3, 2025
34 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ESQL: Fill in topn values if competitive #135734

ESQL: Fill in topn values if competitive #135734

Uh oh!

nik9000 commented Sep 30, 2025

Uh oh!

elasticsearchmachine commented Sep 30, 2025

Uh oh!

elasticsearchmachine commented Sep 30, 2025

Uh oh!

nik9000 Oct 1, 2025

Uh oh!

GalLalouche Oct 3, 2025

Uh oh!

ivancea left a comment

Uh oh!

ivancea Oct 1, 2025

Uh oh!

nik9000 Oct 2, 2025

Uh oh!

ivancea Oct 1, 2025

Uh oh!

nik9000 Oct 2, 2025

Uh oh!

dnhatn left a comment

Uh oh!

Uh oh!

nik9000 commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ESQL: Fill in topn values if competitive #135734

ESQL: Fill in topn values if competitive #135734

Uh oh!

Conversation

nik9000 commented Sep 30, 2025

Uh oh!

elasticsearchmachine commented Sep 30, 2025

Uh oh!

elasticsearchmachine commented Sep 30, 2025

Uh oh!

nik9000 Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

GalLalouche Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

ivancea left a comment

Choose a reason for hiding this comment

Uh oh!

ivancea Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

nik9000 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

ivancea Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

nik9000 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

dnhatn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nik9000 commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants