ES|QL: Make TopN aggregator use heap sort internally. #134140

przemekwitek · 2025-09-04T14:16:09Z

Until now, the TopN aggregator has been copying the bucket values to the separate array and sorting them using Arrays.sort method call.
This PR makes use of the existing heap structure to sort in-place using standard heap sort algorithm.

elasticsearchmachine · 2025-09-04T15:30:45Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

…elasticsearch into topn_use_heap_sort

ivancea · 2025-09-04T15:47:35Z

We also have the BytesRefBucketedSort and IpBucketedSort classes that aren't autogenerated. The same comment about the sort is there, if you want to give them a try. I think it should be quite similar, if not identical.

Same with

elasticsearch/server/src/main/java/org/elasticsearch/search/sort/BucketedSort.java

Line 187 in 1ba21c2

    
           // TODO we usually have a heap here so we could use that to build the results sorted.

for _search, which is the base of these classes. I'm not sure if it's worth it there though, and it may be a bit different. @nik9000? (In any case, this one doesn't have to be in this PR, I'm just commenting)

nik9000 · 2025-09-04T17:05:14Z

for _search, which is the base of these classes. I'm not sure if it's worth it there though, and it may be a bit different. @nik9000? (In any case, this one doesn't have to be in this PR, I'm just commenting)

I think the testing is pretty good over there. I'd grab there first and, if you have time, grab the next ones. But these are more imporant I think.

nik9000

In BucketedSortTestCase we only use up to three sorted values. It's probably worth adding a case where we use a big list of them here. Just to make sure you didn't off-by-one or round funny or something. Which I'd never catch with code review.

przemekwitek · 2025-09-05T12:07:09Z

We also have the BytesRefBucketedSort and IpBucketedSort classes that aren't autogenerated. The same comment about the sort is there, if you want to give them a try. I think it should be quite similar, if not identical.

Done.

przemekwitek · 2025-09-08T07:44:11Z

In BucketedSortTestCase we only use up to three sorted values. It's probably worth adding a case where we use a big list of them here.

IIUC such a test already exists: BucketedSortTestCase.testManyBucketsManyHits.
This test collects 10000 values in random order and then asserts that the values are returned as sorted.

Do you think there is anything more I should do on this PR?
I've noticed some code duplication wrt heap structures between those classes, but that could be better handled in a separate PR if we decide to do so.

nik9000 · 2025-09-08T15:13:12Z

This test collects 10000 values in random order and then asserts that the values are returned as sorted.

Ah. That's good.

I've noticed some code duplication wrt heap structures between those classes, but that could be better handled in a separate PR if we decide to do so.

Yeah. That's a fine for later thing. Some of that duplication often comes from using the templates. Some amount of duplication is to monomorphize the tight loops. We tend to default to this kind of behavior in the hot loop, which this is, and we'll do microbenchmarks if we're trying to be more careful.

przemekwitek · 2025-09-10T09:37:00Z

Same with

elasticsearch/server/src/main/java/org/elasticsearch/search/sort/BucketedSort.java

Line 187 in 1ba21c2

// TODO we usually have a heap here so we could use that to build the results sorted.

for _search, which is the base of these classes. I'm not sure if it's worth it there though, and it may be a bit different.

Indeed, the code is a bit different, I would say more advanced (it has support for extra values which we may need to port to ES|QL's version at some point).

I did local changes there and run a benchmark.
The version fetched from main has results:

# Warmup Iteration   1: 3.913 ns/op
Iteration   1: 4.008 ns/opING [1m 39s]
Iteration   2: 4.107 ns/opING [1m 49s]
Iteration   3: 4.006 ns/opING [1m 59s]
Iteration   4: 4.163 ns/opING [2m 9s]
Iteration   5: 4.008 ns/opING [2m 19s]
Iteration   6: 4.044 ns/opING [2m 29s]
Iteration   7: 4.101 ns/opING [2m 39s]
Iteration   8: 4.003 ns/opING [2m 49s]
Iteration   9: 4.070 ns/opING [2m 59s]
Iteration  10: 4.147 ns/opING [3m 9s]

Result "org.elasticsearch.benchmark.compute.operator.AggregatorBenchmark.run":
  4.066 ±(99.9%) 0.093 ns/op [Average]
  (min, avg, max) = (4.003, 4.066, 4.163), stdev = 0.061
  CI (99.9%): [3.973, 4.158] (assumes normal distribution)

whereas the local change with heap sort had:

# Warmup Iteration   1: 4.148 ns/op
Iteration   1: 3.989 ns/opING [44s]
Iteration   2: 4.205 ns/opING [54s]
Iteration   3: 4.078 ns/opING [1m 4s]
Iteration   4: 4.300 ns/opING [1m 14s]
Iteration   5: 4.102 ns/opING [1m 24s]
Iteration   6: 3.987 ns/opING [1m 34s]
Iteration   7: 3.933 ns/opING [1m 44s]
Iteration   8: 3.896 ns/opING [1m 54s]
Iteration   9: 3.885 ns/opING [2m 4s]
Iteration  10: 4.241 ns/opING [2m 14s]

Result "org.elasticsearch.benchmark.compute.operator.AggregatorBenchmark.run":
  4.062 ±(99.9%) 0.224 ns/op [Average]
  (min, avg, max) = (3.885, 4.062, 4.300), stdev = 0.148
  CI (99.9%): [3.838, 4.286] (assumes normal distribution)

so it's very similar. The command used was:

./gradlew run -p benchmarks --args "\\.AggregatorBenchmark -pgrouping=none -pop=top -pblockType=vector_longs -pfilter=none -wi 1 -i 10"

ivancea

Thanks for the benchies! I suppose it will be difficult to see great improvements in the benchmarks we have, as ingestion will take most of the time and result collection is done only once. Maybe a larger limit and less input data? Anyway, performance didn't suffer apparently, so I think it's nice as it is!

przemekwitek · 2025-09-10T11:25:25Z

I suppose it will be difficult to see great improvements in the benchmarks we have

FWIW: Apart from the time aspect, we save on some memory as we do not have to allocate this temporary list.

przemekwitek added the WIP label Sep 4, 2025

elasticsearchmachine added the v9.2.0 label Sep 4, 2025

przemekwitek changed the title ~~Make TopN aggregator use heap sort internally.~~ ES|QL: Make TopN aggregator use heap sort internally. Sep 4, 2025

Make TopN aggregator use heap sort internally.

8371793

przemekwitek force-pushed the topn_use_heap_sort branch from 877b852 to 8371793 Compare September 4, 2025 15:22

przemekwitek removed the WIP label Sep 4, 2025

przemekwitek marked this pull request as ready for review September 4, 2025 15:29

przemekwitek added :Analytics/ES|QL AKA ESQL >refactoring labels Sep 4, 2025

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Sep 4, 2025

przemekwitek added 2 commits September 4, 2025 17:31

Make TopN aggregator use heap sort internally.

afeaf73

Merge branch 'topn_use_heap_sort' of https://github.com/przemekwitek/…

12fa01f

…elasticsearch into topn_use_heap_sort

ivancea requested review from ivancea and nik9000 September 4, 2025 15:44

nik9000 reviewed Sep 4, 2025

View reviewed changes

Use heap sort in BytesRefBucketedSort and IpBucketedSort too.

f2c9c08

Use "start" rather than "rootIndex"

5629d4a

ivancea approved these changes Sep 10, 2025

View reviewed changes

przemekwitek merged commit 20d6049 into elastic:main Sep 10, 2025
33 checks passed

przemekwitek deleted the topn_use_heap_sort branch September 10, 2025 11:25

GalLalouche mentioned this pull request Sep 17, 2025

SampleBooleanAggregatorFunctionTests.testSimpleWithCranky fails #134918

Closed

przemekwitek mentioned this pull request Sep 23, 2025

Fix SampleXXXAggregatorFunctionTests.testSimpleWithCranky test #135261

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ES|QL: Make TopN aggregator use heap sort internally. #134140

ES|QL: Make TopN aggregator use heap sort internally. #134140

Uh oh!

przemekwitek commented Sep 4, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Sep 4, 2025

Uh oh!

ivancea commented Sep 4, 2025 •

edited

Loading

Uh oh!

nik9000 commented Sep 4, 2025

Uh oh!

nik9000 left a comment

Uh oh!

przemekwitek commented Sep 5, 2025

Uh oh!

przemekwitek commented Sep 8, 2025 •

edited

Loading

Uh oh!

nik9000 commented Sep 8, 2025

Uh oh!

przemekwitek commented Sep 10, 2025

Uh oh!

ivancea left a comment

Uh oh!

przemekwitek commented Sep 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ES|QL: Make TopN aggregator use heap sort internally. #134140

ES|QL: Make TopN aggregator use heap sort internally. #134140

Uh oh!

Conversation

przemekwitek commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Sep 4, 2025

Uh oh!

ivancea commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nik9000 commented Sep 4, 2025

Uh oh!

nik9000 left a comment

Choose a reason for hiding this comment

Uh oh!

przemekwitek commented Sep 5, 2025

Uh oh!

przemekwitek commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nik9000 commented Sep 8, 2025

Uh oh!

przemekwitek commented Sep 10, 2025

Uh oh!

ivancea left a comment

Choose a reason for hiding this comment

Uh oh!

przemekwitek commented Sep 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

przemekwitek commented Sep 4, 2025 •

edited

Loading

ivancea commented Sep 4, 2025 •

edited

Loading

przemekwitek commented Sep 8, 2025 •

edited

Loading