[Downsampling++] Add time series telemetry in xpack usage #134214

gmarouli · 2025-09-05T11:28:17Z

In this PR we introduce telemetry for time series trying to answer the following questions:

How many time series data streams (tsds) does a cluster have?
How many time series indices do they have?
How many tsds are downsampled by ILM?
How many downsampling rounds are being used with ILM?
How many tsds are downsampled by DLM?
How many downsampling rounds are being used with DLM?
How are the numbers differ between serverless and stateful?
Which ILM phase is most commonly used for downsampling?

Fixes: #133953

elasticsearchmachine · 2025-09-05T11:29:03Z

Hi @gmarouli, I've created a changelog YAML for you.

elasticsearchmachine · 2025-09-08T06:09:41Z

Pinging @elastic/es-storage-engine (Team:StorageEngine)

kkrik-es · 2025-09-08T10:36:22Z

...alClusterTest/java/org/elasticsearch/xpack/core/action/TimeSeriesUsageTransportActionIT.java

+        /*
+         * We now add a number of simulated data streams to the cluster state. We mix different combinations of:
+         * - time series and standard data streams & backing indices
+         * - lifecycle with or without downsampling


Suggested change

* - lifecycle with or without downsampling

* - DLM with or without downsampling

kkrik-es · 2025-09-08T10:39:00Z

...alClusterTest/java/org/elasticsearch/xpack/core/action/TimeSeriesUsageTransportActionIT.java

+                var downsamplingConfiguredBy = randomFrom(DownsampledBy.values());
+                boolean isDownsampled = downsamplingConfiguredBy != DownsampledBy.NONE && isTimeSeriesDataStream;
+                // An index/data stream can have both ILM & DLM configured; by default, ILM "wins"
+                boolean hasLifecycle = usually() || (isDownsampled && downsamplingConfiguredBy == DownsampledBy.DLM);


Nit: usually is !rarely so it's almost always.. Maybe use randomDouble() < 0.8 or so, to make it more concrete?

kkrik-es · 2025-09-08T10:56:50Z

...n/core/src/main/java/org/elasticsearch/xpack/core/action/TimeSeriesUsageTransportAction.java

+                tsIndexCount,
+                ilmStats.getDownsamplingStats(),
+                ilmStats.getIlmPolicyStats(),
+                dlmStats.getDownsamplingStats(),


Why dlmStats?

You mean why I picked this variable name?

I thought this call is for ilmStats, surprised to see dlmStats for this arg.

Ah I see. There are two different constructors one that accepts stats for both ILM and DLM and one that only accepts DLM for the serverless use case, this was syntactic sugar to make more explicit that ILM stats would be null.

I will add a comment for clarity

kkrik-es · 2025-09-08T11:04:46Z

...n/core/src/main/java/org/elasticsearch/xpack/core/datastreams/TimeSeriesFeatureSetUsage.java

+                builder.field("phases_in_use", phasesUsedInDownsampling);
+                builder.endObject();
+            }
+            builder.startObject("dlm");


Should we also check for null dlmDownsamplingStats here?

It doesn't hurt, I will add it.

kkrik-es · 2025-09-08T11:06:51Z

...n/core/src/main/java/org/elasticsearch/xpack/core/datastreams/TimeSeriesFeatureSetUsage.java

+        }
+    }
+
+    public record DownsamplingFeatureStats(long dataStreamsCount, long indexCount, long minRounds, double averageRounds, long maxRounds)


Are you planning to add stats on dataset size and reduction, later?

Stats on dataset size and reduction require a lot more infrastructure that we do not have currently. If we do move ahead with these plans then yes, I think we should try to expose them here too.

kkrik-es

Nice and clean. Consider asking Martijn to take a look too.

martijnvg

LGTM 👍

Add time series telemetry in xpack usage

3d81107

gmarouli added >enhancement :StorageEngine/Downsampling Downsampling (replacement for rollups) - Turn fine-grained time-based data into coarser-grained data labels Sep 5, 2025

elasticsearchmachine added the v9.2.0 label Sep 5, 2025

Update docs/changelog/134214.yaml

5e46643

gmarouli added 3 commits September 5, 2025 14:30

Merge branch 'main' into downsampling++/add-basic-telemetry

bd34cb5

Merge branch 'main' into downsampling++/add-basic-telemetry

5ebef0e

Grant new xpack field non-operator access

df8a474

gmarouli marked this pull request as ready for review September 8, 2025 06:09

Merge branch 'main' into downsampling++/add-basic-telemetry

543fe12

gmarouli requested a review from kkrik-es September 8, 2025 06:09

elasticsearchmachine added the Team:StorageEngine label Sep 8, 2025

Polish

69f7642

kkrik-es reviewed Sep 8, 2025

View reviewed changes

kkrik-es approved these changes Sep 8, 2025

View reviewed changes

gmarouli added 2 commits September 8, 2025 14:29

Apply review feedback

1413cc7

Add null check for dlm stats

ae4412f

gmarouli requested a review from martijnvg September 8, 2025 11:33

martijnvg approved these changes Sep 8, 2025

View reviewed changes

gmarouli added 2 commits September 8, 2025 15:14

Add comment about omitting ILM stats

7aa255f

Merge branch 'main' into downsampling++/add-basic-telemetry

9b68b0c

gmarouli added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Sep 8, 2025

elasticsearchmachine merged commit 5374c33 into elastic:main Sep 8, 2025
34 checks passed

gmarouli deleted the downsampling++/add-basic-telemetry branch September 8, 2025 13:55

jdcryans mentioned this pull request Sep 8, 2025

[CI] TimeSeriesFeatureSetUsageTests testEqualsAndHashcode failing #134332

Closed

gmarouli mentioned this pull request Nov 17, 2025

Extend time series telemetry to track the downsampling method #138187

Merged

	* - lifecycle with or without downsampling
	* - DLM with or without downsampling

[Downsampling++] Add time series telemetry in xpack usage #134214

[Downsampling++] Add time series telemetry in xpack usage #134214

Uh oh!

Conversation

gmarouli commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Sep 5, 2025

Uh oh!

elasticsearchmachine commented Sep 8, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kkrik-es left a comment

Choose a reason for hiding this comment

Uh oh!

martijnvg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gmarouli commented Sep 5, 2025 •

edited

Loading