ESQL: Keep ordinals in conversion functions #125357

nik9000 · 2025-03-21T01:14:57Z

Make the conversion functions that process BytesRefs into BytesRefs keep the OrdinalBytesRefVectors when processing. Let's use TO_LOWER as an example. First, the performance numbers:

  (operation)  Mode   Score   Error ->  Score    Error Units
     to_lower  30.662 ± 6.163 -> 30.048 ±  0.479 ns/op
to_lower_ords  30.773 ± 0.370 ->  0.025 ±  0.001 ns/op
     to_upper  33.552 ± 0.529 -> 35.775 ±  1.799 ns/op
to_upper_ords  35.791 ± 0.658 ->  0.027 ±  0.001 ns/op

The test has a 8192 positions containing alternating foo and bar. Running TO_LOWER via ordinals is super duper faster. No longer O(positions) and now O(unique_values).

Let's paint some pictures! OrdinalBytesRefVector is a lookup table. Like this:

+-------+----------+
| bytes | ordinals |
| ----- | -------- |
|  FOO  | 0        |
|  BAR  | 1        |
|  BAZ  | 2        |
+-------+ 1        |
        | 1        |
        | 0        |
        +----------+

That lookup table is one block. When you read it you look up the ordinal and match it to the bytes. Previously TO_LOWER would process each value one at a time and make:

bytes
-----
 foo
 bar
 baz
 bar
 bar
 foo

So it'd run TO_LOWER once per ordinal and it'd make an ordinal non-lookup table. With this change TO_LOWER will now make:

+-------+----------+
| bytes | ordinals |
| ----- | -------- |
|  foo  | 0        |
|  bar  | 1        |
|  baz  | 2        |
+-------+ 1        |
        | 1        |
        | 0        |
        +----------+

We don't even have to copy the ordinals - we can reuse those from the input and just bump the reference count. That's why this goes from O(positions) to O(unique_values).

Make the conversion functions that process `BytesRef`s into `BytesRefs` keep the `OrdinalBytesRefVector`s when processing. Let's use `TO_LOWER` as an example. First, the performance numbers: ``` (operation) Mode Score Error -> Score Error Units to_lower 30.662 ± 6.163 -> 30.048 ± 0.479 ns/op to_lower_ords 30.773 ± 0.370 -> 0.025 ± 0.001 ns/op to_upper 33.552 ± 0.529 -> 35.775 ± 1.799 ns/op to_upper_ords 35.791 ± 0.658 -> 0.027 ± 0.001 ns/op ``` The test has a 8192 positions containing alternating `foo` and `bar`. Running `TO_LOWER` via ordinals is super duper faster. No longer `O(positions)` and now `O(unique_values)`. Let's paint some pictures! `OrdinalBytesRefVector` is a lookup table. Like this: ``` +-------+----------+ | bytes | ordinals | | ----- | -------- | | FOO | 0 | | BAR | 1 | | BAZ | 2 | +-------+ 1 | | 1 | | 0 | +----------+ ``` That lookup table is one block. When you read it you look up the `ordinal` and match it to the `bytes`. Previously `TO_LOWER` would process each value one at a time and make: ``` bytes ----- foo bar baz bar bar foo ``` So it'd run `TO_LOWER` once per `ordinal` and it'd make an ordinal non-lookup table. With this change `TO_LOWER` will now make: ``` +-------+----------+ | bytes | ordinals | | ----- | -------- | | foo | 0 | | bar | 1 | | baz | 2 | +-------+ 1 | | 1 | | 0 | +----------+ ``` We don't even have to copy the `ordinals` - we can reuse those from the input and just bump the reference count. That's why this goes from `O(positions)` to `O(unique_values)`.

elasticsearchmachine · 2025-03-21T01:15:22Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

elasticsearchmachine · 2025-03-21T01:15:23Z

Hi @nik9000, I've created a changelog YAML for you.

ivancea

LGTM!

ivancea · 2025-03-21T10:54:39Z

...src/test/java/org/elasticsearch/xpack/esql/expression/function/AbstractFunctionTestCase.java

+                Map<BytesRef, Integer> dedupe = new HashMap<>();
+                BytesRefBlock bytesRefBlock = (BytesRefBlock) block;
+                try (
+                    IntBlock.Builder ordinals = block.blockFactory().newIntBlockBuilder(block.getPositionCount());
+                    BytesRefVector.Builder bytes = block.blockFactory().newBytesRefVectorBuilder(block.getPositionCount())
+                ) {
+                    BytesRef scratch = new BytesRef();
+                    for (int p = 0; p < block.getPositionCount(); p++) {
+                        int first = block.getFirstValueIndex(p);
+                        int count = block.getValueCount(p);
+                        if (count == 0) {
+                            ordinals.appendNull();
+                            continue;
+                        }
+                        if (count == 1) {
+                            BytesRef v = bytesRefBlock.getBytesRef(first, scratch);
+                            ordinals.appendInt(dedupe(dedupe, bytes, v));
+                            continue;
+                        }
+                        int end = first + count;
+                        ordinals.beginPositionEntry();
+                        for (int i = first; i < end; i++) {
+                            BytesRef v = bytesRefBlock.getBytesRef(i, scratch);
+                            ordinals.appendInt(dedupe(dedupe, bytes, v));
+                        }
+                        ordinals.endPositionEntry();
+                    }
+                    blocks[b] = new OrdinalBytesRefBlock(ordinals.build(), bytes.build());
+                    bytesRefBlock.decRef();
+                }
+            }


I wonder if we should move this to BlockUtils or BlockTestUtils preemptively? A BlockUtils.toOrdinals(BytesRefBlock). Or to BytesRefBlock.
Maybe not now, but I'm not sure if we'll remember that this code is here in the future, if we need this again

BlockTestUtils seems like a good place for it.

ivancea · 2025-03-21T11:35:20Z

...sql/compute/gen/src/main/java/org/elasticsearch/compute/gen/ConvertEvaluatorImplementer.java

    ) {
        this.declarationType = (TypeElement) processFunction.getEnclosingElement();
        this.processFunction = new EvaluatorImplementer.ProcessFunction(types, processFunction, warnExceptions);
+        this.canProcessOrdinals = warnExceptions.isEmpty()


About this warnExceptions.isEmpty() check, I would guess that creating a new IntBlock to insert the nulls instead of reusing the existing ordinals, would still be better than what we do now? Maybe for a continuation?

Right - that's basically what we'd do. It's a little tricky to know where to put it. Because you'd process the bytes one by one and not know which one links to them. I think we'd want a separate path if there are warnExceptions.

…p_ords_1

Make the conversion functions that process `BytesRef`s into `BytesRefs` keep the `OrdinalBytesRefVector`s when processing. Let's use `TO_LOWER` as an example. First, the performance numbers: ``` (operation) Mode Score Error -> Score Error Units to_lower 30.662 ± 6.163 -> 30.048 ± 0.479 ns/op to_lower_ords 30.773 ± 0.370 -> 0.025 ± 0.001 ns/op to_upper 33.552 ± 0.529 -> 35.775 ± 1.799 ns/op to_upper_ords 35.791 ± 0.658 -> 0.027 ± 0.001 ns/op ``` The test has a 8192 positions containing alternating `foo` and `bar`. Running `TO_LOWER` via ordinals is super duper faster. No longer `O(positions)` and now `O(unique_values)`. Let's paint some pictures! `OrdinalBytesRefVector` is a lookup table. Like this: ``` +-------+----------+ | bytes | ordinals | | ----- | -------- | | FOO | 0 | | BAR | 1 | | BAZ | 2 | +-------+ 1 | | 1 | | 0 | +----------+ ``` That lookup table is one block. When you read it you look up the `ordinal` and match it to the `bytes`. Previously `TO_LOWER` would process each value one at a time and make: ``` bytes ----- foo bar baz bar bar foo ``` So it'd run `TO_LOWER` once per `ordinal` and it'd make an ordinal non-lookup table. With this change `TO_LOWER` will now make: ``` +-------+----------+ | bytes | ordinals | | ----- | -------- | | foo | 0 | | bar | 1 | | baz | 2 | +-------+ 1 | | 1 | | 0 | +----------+ ``` We don't even have to copy the `ordinals` - we can reuse those from the input and just bump the reference count. That's why this goes from `O(positions)` to `O(unique_values)`.

nik9000 added >enhancement :Analytics/ES|QL AKA ESQL v9.1.0 labels Mar 21, 2025

nik9000 requested a review from ivancea March 21, 2025 01:14

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Mar 21, 2025

Update docs/changelog/125357.yaml

d998837

ivancea approved these changes Mar 21, 2025

View reviewed changes

nik9000 added 4 commits March 21, 2025 09:34

Merge branch 'main' into esql_keep_ords_1

36444b9

Merge branch 'main' into esql_keep_ords_1

4152f19

Move

94553d2

Merge remote-tracking branch 'nik9000/esql_keep_ords_1' into esql_kee…

c208ee0

…p_ords_1

nik9000 enabled auto-merge (squash) March 21, 2025 15:48

Merge branch 'main' into esql_keep_ords_1

ed76a50

nik9000 merged commit c5e7684 into elastic:main Mar 21, 2025
16 of 17 checks passed

nik9000 added the v8.19.0 label May 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ESQL: Keep ordinals in conversion functions #125357

ESQL: Keep ordinals in conversion functions #125357

Uh oh!

nik9000 commented Mar 21, 2025

Uh oh!

elasticsearchmachine commented Mar 21, 2025

Uh oh!

elasticsearchmachine commented Mar 21, 2025

Uh oh!

ivancea left a comment

Uh oh!

ivancea Mar 21, 2025

Uh oh!

nik9000 Mar 21, 2025

Uh oh!

ivancea Mar 21, 2025

Uh oh!

nik9000 Mar 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ESQL: Keep ordinals in conversion functions #125357

ESQL: Keep ordinals in conversion functions #125357

Uh oh!

Conversation

nik9000 commented Mar 21, 2025

Uh oh!

elasticsearchmachine commented Mar 21, 2025

Uh oh!

elasticsearchmachine commented Mar 21, 2025

Uh oh!

ivancea left a comment

Choose a reason for hiding this comment

Uh oh!

ivancea Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

nik9000 Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

ivancea Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

nik9000 Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants