Push compute engine value loading down to tsdb codec #132460

martijnvg · 2025-08-05T16:10:50Z

Evolution of #128334, but is targeted for just loading @timestamp and _tsid field values (tsid is missing and will be added soon) in the context of queries like:

TS metrics-hostmetricsreceiver.otel-default | WHERE `metrics.system.memory.utilization` IS NOT NULL AND @timestamp >= \"2025-07-31T08:58:00.000Z\" AND @timestamp <= \"2025-07-31T10:28:00.000Z\" | STATS AVG(AVG_OVER_TIME(`metrics.system.memory.utilization`)) BY host.name, BUCKET(@timestamp, 1h) | LIMIT 10000

Relates to #128445 and #132379

martijnvg · 2025-08-06T14:03:05Z

I benchmarked this change with the following query:

TS metrics-hostmetricsreceiver.otel-default | STATS COUNT(COUNT_OVER_TIME(`@timestamp`)) BY BUCKET(@timestamp, 1h) | LIMIT 10000

This change only optimizes loading of @timestamp and _tsid fields and the query in the description of pr also loads two other fields (host.name and metrics.system.memory.utilization). Loading these other fields now takes the majority of value source loading and so I'm using the query mentioned here to highlight the impact of this improvement.

Without this change default data partition on my local laptop the average query time after 36 executions is: 957 ms. Detailed profiling of value source operator using shard data partition:

{
    "operator": "ValuesSourceReaderOperator[fields = [@timestamp]]",
    "status": {
        "readers_built": {
            "@timestamp:column_at_a_time:BlockDocValuesReader.SingletonLongs": 59
        },
        "values_loaded": 221184000,
        "process_nanos": 713120339, --> ~713 ms
        "pages_received": 59106,
        "pages_emitted": 59106,
        "rows_received": 221184000,
        "rows_emitted": 221184000
    }
}

{
    "operator": "ValuesSourceReaderOperator[fields = [_tsid]]",
    "status": {
        "readers_built": {
            "_tsid:column_at_a_time:BlockDocValuesReader.SingletonOrdinals": 59
        },
        "values_loaded": 221184000,
        "process_nanos": 941064060, --> ~941 ms
        "pages_received": 59106,
        "pages_emitted": 59106,
        "rows_received": 221184000,
        "rows_emitted": 221184000
    }
}

With this change using default data partition on my local laptop the average query time after 36 executions is: 753 ms. Detailed profiling of value source operator using shard data partition:

{
    "operator": "ValuesSourceReaderOperator[fields = [@timestamp]]",
    "status": {
        "readers_built": {
            "@timestamp:column_at_a_time:TimestampBlockLoader.Timestamps": 59
        },
        "values_loaded": 221184000,
        "process_nanos": 250188096, --> ~250ms
        "pages_received": 59106,
        "pages_emitted": 59106,
        "rows_received": 221184000,
        "rows_emitted": 221184000
    }
}

{
    "operator": "ValuesSourceReaderOperator[fields = [_tsid]]",
    "status": {
        "readers_built": {
            "_tsid:column_at_a_time:TSIDBlockLoader.TSIDs": 59
        },
        "values_loaded": 221184000,
        "process_nanos": 244960319, --> ~245ms
        "pages_received": 59106,
        "pages_emitted": 59106,
        "rows_received": 221184000,
        "rows_emitted": 221184000
    }
}

The detailed profiling output seems to suggest that ~3 times improvement. Note that the profile data was captured with data_partitioning set to shard. Otherwise many more ValuesSourceReaderOperator and then results are not comparable between query executions.

That improvement isn't as visible in the overall query time, because still a big part of time spent is in HashAggregateOperator.

Flame graph with simplified query with this change:

…als are dense. Also added block loader for dimension fields.

This is the first of many changes that pushes loading of field values to the es819 doc values codec in case of logsdb/tsdb and when the field supports it. This change first targets reading field values in bulk mode at codec level when doc values type is numeric doc values or sorted doc values, there is only one value per document, and the field is dense (all documents have a value). Multivalued and sparse fields are more complex to support bulk reading for, but it is possible. With this change, the following field types will support bulk read mode at codec level under the described conditions: long, date, geo_point, point and unsigned_long. Other number types like integer, short, double, float, scaled_float will be supported in a followup, but would be similar to long based fields, but required an additional conversion step to either an int or float vector. This change originates from elastic#132460 (which adds bulk reading to `@timestamp`, `_tsid` and dimension fields) and is basically the timestamp support part of it. In another followup, support for single valued, dense sorted (set) doc values will be added for field like _tsid. Relates to elastic#128445

) This is the first of many changes that pushes loading of field values to the es819 doc values codec in case of logsdb/tsdb and when the field supports it. This change first targets reading field values in bulk mode at codec level when doc values type is numeric doc values or sorted doc values, there is only one value per document, and the field is dense (all documents have a value). Multivalued and sparse fields are more complex to support bulk reading for, but it is possible. With this change, the following field types will support bulk read mode at codec level under the described conditions: long, date, geo_point, point and unsigned_long. Other number types like integer, short, double, float, scaled_float will be supported in a followup, but would be similar to long based fields, but required an additional conversion step to either an int or float vector. This change originates from #132460 (which adds bulk reading to `@timestamp`, `_tsid` and dimension fields) and is basically the timestamp support part of it. In another followup, support for single valued, dense sorted (set) doc values will be added for field like _tsid. Relates to #128445

elasticsearchmachine added the v9.2.0 label Aug 5, 2025

martijnvg changed the title ~~Push compute value loading down to tsdb codec~~ Push compute engine value loading down to tsdb codec Aug 5, 2025

martijnvg and others added 22 commits August 7, 2025 09:07

First attempt block building at codec level

1f0a039

iter

42148bf

[CI] Auto commit changes from spotless

4c9038b

doubles

a1f9e5c

[CI] Auto commit changes from spotless

b7a945a

fix BlockAwareSingletonDoubles

51f0224

adjust for notion of offset after rebasing

7d84821

tweak

b01cdc0

BlockAwareSingletonOrdinals

6a59441

tweak

eb2c985

Remove loadDoc(...) block block aware interfaces.

0482c2d

remove unrelated assertions

b07ab01

More specialization for dense singleton longs.

bbe018b

Restructure code

0dda8c4

[CI] Auto commit changes from spotless

3962dc4

introduce dedicated block loader for timestamp field.

ec5458a

adjust breaker

80f71e2

added not optimized block-based ordinal loading for tsid

65ab9b0

add TSIDOrdinalsBuilder

0f6ad87

better initially size the bytesBuilder

4d05511

[CI] Auto commit changes from spotless

fc2e6b9

optimize tsid ordinal loading

2f1aa71

martijnvg force-pushed the block_aware_2 branch from 68d2784 to 2f1aa71 Compare August 7, 2025 02:07

martijnvg and others added 4 commits August 7, 2025 22:15

Add make use of SortedSetDocValues#termsEnum() to load terms if ordin…

969e71a

…als are dense. Also added block loader for dimension fields.

fix bug

a934ddf

Merge remote-tracking branch 'es/main' into block_aware_2

a493da0

[CI] Auto commit changes from spotless

5eb9593

martijnvg and others added 5 commits August 8, 2025 11:13

fix test

e4a57b0

fixed length bug

6a930f3

Merge remote-tracking branch 'es/main' into block_aware_2

873124a

constant block

8e9928d

[CI] Auto commit changes from spotless

51e95a1

martijnvg force-pushed the block_aware_2 branch from edb81ff to 51e95a1 Compare August 9, 2025 16:05

martijnvg mentioned this pull request Aug 10, 2025

Push compute engine value loading for longs down to tsdb codec. #132622

Merged

martijnvg mentioned this pull request Aug 12, 2025

Bulk doc value loading at codec level #128445

Closed

6 tasks

martijnvg closed this Aug 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Push compute engine value loading down to tsdb codec #132460

Push compute engine value loading down to tsdb codec #132460

Uh oh!

martijnvg commented Aug 5, 2025 •

edited

Loading

Uh oh!

martijnvg commented Aug 6, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Push compute engine value loading down to tsdb codec #132460

Push compute engine value loading down to tsdb codec #132460

Uh oh!

Conversation

martijnvg commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martijnvg commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

martijnvg commented Aug 5, 2025 •

edited

Loading

martijnvg commented Aug 6, 2025 •

edited

Loading