ESQL: Add support for exponential_histogram in code generation #137459

JonasKunz · 2025-10-31T14:02:03Z

Adds support for the exponential_histogram type in the Evaluator, Aggregator and Grouping Aggregator code generation. MvEvaluator and ConvertEvaluator have not been touched yet, we'll get to those when we actually need them for exponential histograms.

The code generation is changed as follows:

The "scratch" concept which previously was only used for BytesRef has been generalized to also work with exponential histograms, as there the getter also requires a scratch
Added support for types never having vectors (and not even having a vector type).

In my understanding, ExponentialHistogramBlocks will never have a corresponding Vector , as they don't have a memory layout which directly benefits from e.g. SIMD. Vectorization still works by first "extracting" the sub-blocks, which might be vector-backed.

For example MIN(histogram) will be implemented as a surrogate MIN(HISTOGRAM_MIN(histogram)), where HISTOGRAM_MIN is a function which directly returns the min-subblock, which in turn might be a vector.

To see these code-generation changes in action, please have a look at my PoC PR:

HistogramPercentile function ad generated Evaluator: This function extracts a single percentile from a single histogram value
Merge, aggregator declaration and the corresponding non-grouping aggregator and grouping aggregator

Note that I'm not planning on exposing those two functions to end-users for now. Instead, they will be just used as surrogate for the existing PERCENTILE agg.

…neration

JonasKunz · 2025-10-31T14:27:40Z

...ugin/esql/compute/gen/src/main/java/org/elasticsearch/compute/gen/AggregatorImplementer.java

-            if (intermediateState.stream().map(IntermediateStateDesc::elementType).anyMatch(n -> n.equals("BYTES_REF"))) {
-                builder.addStatement("$T scratch = new $T()", BYTES_REF, BYTES_REF);
+            for (IntermediateStateDesc interState : intermediateState) {
+                interState.addScratchDeclaration(builder);


I think this was buggy prior to my change, but without causing a defect?

If there were multiple BYTES_REF state-members, they would have shared the same scratch, which seems incorrect?

That seems possible. I imagine we don't have any intermediate states with two strings.

elasticsearchmachine · 2025-10-31T14:28:30Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

nik9000 · 2025-10-31T14:32:25Z

In my understanding, ExponentialHistogramBlocks will never have a corresponding Vector , as they don't have a memory layout which directly benefits from e.g. SIMD. Vectorization still works by first "extracting" the sub-blocks, which might be vector-backed.

That's fine. Maybe they will one day we'll do it then.

nik9000 · 2025-10-31T17:33:35Z

...ugin/esql/compute/gen/src/main/java/org/elasticsearch/compute/gen/AggregatorImplementer.java


-        this.hasOnlyBlockArguments = this.aggParams.stream().allMatch(a -> a instanceof BlockArgument);
+        this.tryToUseVectors = aggParams.stream().anyMatch(a -> (a instanceof BlockArgument) == false)
+            && aggParams.stream().noneMatch(a -> a instanceof StandardArgument && a.dataType(false) == null);


I'd prefer making a method in Argument that's, like, hasVector or something.

Fixed in 00d6d78.

nik9000 · 2025-10-31T17:35:00Z

...ugin/esql/compute/gen/src/main/java/org/elasticsearch/compute/gen/AggregatorImplementer.java

-            if (intermediateState.stream().map(IntermediateStateDesc::elementType).anyMatch(n -> n.equals("BYTES_REF"))) {
-                builder.addStatement("$T scratch = new $T()", BYTES_REF, BYTES_REF);
+            for (IntermediateStateDesc interState : intermediateState) {
+                interState.addScratchDeclaration(builder);


That seems possible. I imagine we don't have any intermediate states with two strings.

…pport

nik9000

LGTM

...l/compute/gen/src/main/java/org/elasticsearch/compute/gen/GroupingAggregatorImplementer.java

…pport

JonasKunz added 2 commits October 31, 2025 14:58

Add exponential_histogram support in Evaluator and Aggregator code ge…

b5a27e0

…neration

Generate code

5466107

JonasKunz added the :Analytics/ES|QL AKA ESQL label Oct 31, 2025

elasticsearchmachine added v9.3.0 external-contributor Pull request authored by a developer outside the Elasticsearch team labels Oct 31, 2025

JonasKunz added >non-issue and removed external-contributor Pull request authored by a developer outside the Elasticsearch team v9.3.0 labels Oct 31, 2025

Merge branch 'main' into exp-histo-codegen-support

1e7f784

JonasKunz commented Oct 31, 2025

View reviewed changes

JonasKunz marked this pull request as ready for review October 31, 2025 14:28

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Oct 31, 2025

JonasKunz requested a review from nik9000 October 31, 2025 14:30

JonasKunz added the v9.3.0 label Oct 31, 2025

nik9000 requested changes Oct 31, 2025

View reviewed changes

JonasKunz added 2 commits November 3, 2025 09:16

Add hasVector method to Argument

00d6d78

Merge remote-tracking branch 'elastic/main' into exp-histo-codegen-su…

504f48d

…pport

JonasKunz requested a review from nik9000 November 3, 2025 09:13

Merge branch 'main' into exp-histo-codegen-support

1e7c489

nik9000 approved these changes Nov 3, 2025

View reviewed changes

...l/compute/gen/src/main/java/org/elasticsearch/compute/gen/GroupingAggregatorImplementer.java Outdated Show resolved Hide resolved

JonasKunz added 2 commits November 3, 2025 15:57

try to get rid of some instanceofs

6aca146

Merge remote-tracking branch 'elastic/main' into exp-histo-codegen-su…

7e0e697

…pport

nik9000 approved these changes Nov 3, 2025

View reviewed changes

JonasKunz enabled auto-merge (squash) November 3, 2025 15:15

JonasKunz merged commit 2047c9a into elastic:main Nov 3, 2025
33 of 34 checks passed

JonasKunz deleted the exp-histo-codegen-support branch November 3, 2025 18:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ESQL: Add support for exponential_histogram in code generation #137459

ESQL: Add support for exponential_histogram in code generation #137459

Uh oh!

JonasKunz commented Oct 31, 2025 •

edited

Loading

Uh oh!

JonasKunz Oct 31, 2025

Uh oh!

nik9000 Oct 31, 2025

Uh oh!

elasticsearchmachine commented Oct 31, 2025

Uh oh!

nik9000 commented Oct 31, 2025

Uh oh!

nik9000 Oct 31, 2025

Uh oh!

JonasKunz Nov 3, 2025

Uh oh!

nik9000 Oct 31, 2025

Uh oh!

nik9000 left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ESQL: Add support for exponential_histogram in code generation #137459

ESQL: Add support for exponential_histogram in code generation #137459

Uh oh!

Conversation

JonasKunz commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JonasKunz Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

nik9000 Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Oct 31, 2025

Uh oh!

nik9000 commented Oct 31, 2025

Uh oh!

nik9000 Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

JonasKunz Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

nik9000 Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

nik9000 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JonasKunz commented Oct 31, 2025 •

edited

Loading