OTLP: add support for histograms #133902

felixbarny · 2025-09-01T06:58:48Z

Adds support for histograms and exponential histograms.

Part of #133057.

elasticsearchmachine · 2025-09-01T06:59:12Z

Pinging @elastic/es-storage-engine (Team:StorageEngine)

...lugin/otel-data/src/main/java/org/elasticsearch/xpack/oteldata/otlp/datapoint/DataPoint.java

Co-authored-by: Jonas Kunz <[email protected]>

...lugin/otel-data/src/main/java/org/elasticsearch/xpack/oteldata/otlp/datapoint/DataPoint.java

Co-authored-by: Jonas Kunz <[email protected]>

.../src/main/java/org/elasticsearch/xpack/oteldata/otlp/datapoint/DataPointGroupingContext.java

kkrik-es · 2025-09-08T07:35:57Z

...java/org/elasticsearch/xpack/oteldata/otlp/datapoint/ExponentialHistogramConverterTests.java

+    }
+
+    @ParametersFactory(argumentFormatting = "%1$s")
+    public static List<Object[]> testCases() {


Can we add randomized testing too?

Not sure how this would work. If we create the histogram randomly, how do we determine the expected values/counts?

I guess we could do some sort of randomized smoke testing to check that converting randomly created histograms does not throw exceptions, without validating the counts/values. Is that what you had in mind?

You can generate a random set of values, then build an ExpHisto or bucketed histo out of them and a TDigest as a control. Converting should get you close to the control TDigest, with some error margin?

I assume we'd need to compare the error in calculating percentile values. Comparing the resulting buckets directly doesn't seem feasible as there are no guarantee that an exponential histogram and a TDigest histogram would create an equal number of buckets for a given set of raw values.

This seems to test the algorithm for creating the histograms and calculating the percentiles more so than the actual conversion. Also not sure if there's a good way to estimate a sensible error bound. The error will depend a lot on the parameters of the histogram algorithm, such as the max number of buckets. Both algorithms have different parameters that affect the precision in different ways.

It goes into the direction of comparing the inherent accuracy of TDigest and exponential histograms which seems outside of the scope of converting an externally recorded exponential histogram into the most appropriate TDigest-based representation. We know that there will be a precision loss and we're working on having native support for exponential histograms.

Are you suggesting that the error from the double conversion is unbound? That would be really bad..

Otherwise, let's make sure there's coverage with random datasets for the logic that performs the conversion. Percentile results should be close, with reasonable error margins.

I'm not saying it's unbound, it's just difficult to set a bound on it.
Ideally, the error shouldn't exceed the individual error from exponential histograms plus TDigest. I made some empirical testing using different distributions and had to multiply the combined error by 2 to be on the safe side.

Does that level of testing seem good for now?

kkrik-es · 2025-09-08T07:46:02Z

...a/src/test/java/org/elasticsearch/xpack/oteldata/otlp/datapoint/HistogramConverterTests.java

+        this.valid = valid;
+    }
+
+    public void testHistograms() throws Exception {


Same can we do randomized testing too?

This is more difficult compared to the testing for exponential histograms I did. I don't think there's a reasonable error bound that we can calculate or estimate for explicit boundary histograms as the bucket boundaries are user-configurable.

I'd say we skip this for now.

…istograms

kkrik-es · 2025-09-09T07:26:35Z

.../elasticsearch/xpack/oteldata/otlp/datapoint/ExponentialHistogramConverterAccuracyTests.java

+        }
+        double exponentialHistogramMaxError = QuantileAccuracyTests.getMaximumRelativeError(samples, numBuckets);
+        double combinedRelativeError = rawTDigestMaxError + exponentialHistogramMaxError;
+        assertThat(convertedTDigestMaxError, lessThanOrEqualTo(combinedRelativeError * 2));


Let's add a comment about the 2 factor here.

I assume you ran it 1000 times to make sure this won't be noisy..

I've added a comment.

I assume you ran it 1000 times to make sure this won't be noisy..

Yes I have. I have now also executed 10,000 runs and with the lowest number of buckets and samples. There were some failures, so I've bumped the min number of samples we're testing with.

kkrik-es · 2025-09-09T07:27:36Z

.../elasticsearch/xpack/oteldata/otlp/datapoint/ExponentialHistogramConverterAccuracyTests.java

+    }
+
+    private static TDigest convertToTDigest(ExponentialHistogramDataPoint otlpHistogram) {
+        TDigest result = TDigest.createAvlTreeDigest(arrays, 100);


Nit: I'd use createHybridDigest that's the default.

I've tested it but this makes the error worse so that the assertion frequently fails.

kkrik-es

Nice.

OTLP: add support for histograms

b7584ad

felixbarny requested review from JonasKunz and kkrik-es September 1, 2025 06:58

felixbarny self-assigned this Sep 1, 2025

felixbarny added >non-issue :StorageEngine/TSDB You know, for Metrics labels Sep 1, 2025

felixbarny mentioned this pull request Sep 1, 2025

Add OTLP metrics endpoint #133057

Closed

elasticsearchmachine added Team:StorageEngine v9.2.0 external-contributor Pull request authored by a developer outside the Elasticsearch team labels Sep 1, 2025

[CI] Auto commit changes from spotless

8492a44

JonasKunz approved these changes Sep 1, 2025

View reviewed changes

...lugin/otel-data/src/main/java/org/elasticsearch/xpack/oteldata/otlp/datapoint/DataPoint.java Outdated Show resolved Hide resolved

felixbarny and others added 5 commits September 1, 2025 09:59

Apply suggestions from code review

b3c2f4b

Co-authored-by: Jonas Kunz <[email protected]>

[CI] Auto commit changes from spotless

f9e64c6

Merge remote-tracking branch 'origin/main' into otlp-histograms

444f614

Add missing import

222f9aa

Merge remote-tracking branch 'origin/main' into otlp-histograms

2ed63ea

JonasKunz reviewed Sep 1, 2025

View reviewed changes

...lugin/otel-data/src/main/java/org/elasticsearch/xpack/oteldata/otlp/datapoint/DataPoint.java Outdated Show resolved Hide resolved

felixbarny and others added 6 commits September 1, 2025 12:28

Add buildMetricValue method to histogram data points

95b1bda

[CI] Auto commit changes from spotless

6842cd4

Simplify boolean expression

f159188

Co-authored-by: Jonas Kunz <[email protected]>

Merge remote-tracking branch 'origin/main' into otlp-histograms

cf6a514

Add support for mapping hints

ed203a4

Merge branch 'main' into otlp-histograms

0d588ff

felixbarny requested a review from a team as a code owner September 2, 2025 10:09

felixbarny added 4 commits September 2, 2025 13:09

Merge remote-tracking branch 'origin/main' into otlp-histograms

64cd047

Fix compile error after merge

d3cfe08

Merge remote-tracking branch 'origin/main' into otlp-histograms

8c4c621

Fix compile error after merge

7f5d421

felixbarny and others added 3 commits September 2, 2025 16:34

Merge remote-tracking branch 'origin/main' into otlp-histograms

b26d645

[CI] Auto commit changes from spotless

a8efd23

Merge branch 'main' into otlp-histograms

2e5866d

kkrik-es reviewed Sep 8, 2025

View reviewed changes

.../src/main/java/org/elasticsearch/xpack/oteldata/otlp/datapoint/DataPointGroupingContext.java Show resolved Hide resolved

kkrik-es reviewed Sep 8, 2025

View reviewed changes

felixbarny and others added 9 commits September 8, 2025 10:50

Add rest tests for histograms

f019162

Add comment about converting exponential histograms to TDigest

90cbac2

[CI] Auto commit changes from spotless

ed73673

Add accuracy tests for histogram converter

18e7f76

Merge remote-tracking branch 'felixbarny/otlp-histograms' into otlp-h…

06a0039

…istograms

[CI] Auto commit changes from spotless

ca49987

Test with more distributions

9cf95b0

Merge remote-tracking branch 'origin/main' into otlp-histograms

10d6556

Merge remote-tracking branch 'felixbarny/otlp-histograms' into otlp-h…

35d4c08

…istograms

kkrik-es reviewed Sep 9, 2025

View reviewed changes

kkrik-es approved these changes Sep 9, 2025

View reviewed changes

Add comment about error bound

2bbdd25

felixbarny enabled auto-merge (squash) September 9, 2025 12:09

felixbarny merged commit 63840f1 into elastic:main Sep 9, 2025
34 checks passed

felixbarny deleted the otlp-histograms branch September 9, 2025 12:24

rjernst pushed a commit to rjernst/elasticsearch that referenced this pull request Sep 9, 2025

OTLP: add support for histograms (elastic#133902)

701ffca

Kubik42 pushed a commit to Kubik42/elasticsearch that referenced this pull request Sep 9, 2025

OTLP: add support for histograms (elastic#133902)

af60c2a

OTLP: add support for histograms #133902

OTLP: add support for histograms #133902

Uh oh!

Conversation

felixbarny commented Sep 1, 2025

Uh oh!

elasticsearchmachine commented Sep 1, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kkrik-es left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants