Add practical tips for downsampling. #3340

gmarouli · 2025-10-06T13:57:34Z

In this PR, we propose to add practical tips for downsampling. For now this includes, a guideline on how to choose the downsampling interval. And then specifically for ILM, an explanation on how downsampling relates with tiers. After elastic/elasticsearch#135834, we should also add here the option to disable force merge.

kkrik-es · 2025-10-06T13:59:43Z

Can we wait for Marci to submit her update, then port these tips to the new structure?

kkrik-es · 2025-10-06T14:02:09Z

manage-data/data-store/data-streams/run-downsampling.md

+## Practical tips
+
+Downsampling requires reading and indexing the contents of a backing index. The following guidelines can help you get the most out of it.
+


Do we need a note about rollover? To avoid creating backing indices that are too big..

I have been going back and forth for this. For ILM it's easy because it's part of the policy, for data stream lifecycle, I would suggest that if we really think that it should be less maybe we should set it to something less. Right?

You mean, update the default? We can do that at a later point, but what about older versions, or ILM configurations with existing rollover overrides? It could still help to suggest a best practice here.

Yes, we could update the default, that would apply on all version unless the user chose to overwrite it. I restructure it a bit so we can have ILM focused recommendations. But if we think it should be reduced, we should consider updating the default for DLM as well.

Let's file a tracking issue for this, so that we don't forget.

gmarouli · 2025-10-06T14:02:21Z

Can we wait for Marci to submit her update, then port these tips to the new structure?

It is created on the updated downsampling page. Right?

github-actions · 2025-10-07T07:51:23Z

🔍 Preview links for changed docs

manage-data/data-store/data-streams/run-downsampling.md

manage-data/data-store/data-streams/run-downsampling.md

Co-authored-by: Kostas Krikellas <[email protected]>

manage-data/data-store/data-streams/run-downsampling.md

Co-authored-by: Kostas Krikellas <[email protected]>

kkrik-es · 2025-10-07T12:44:43Z

manage-data/data-store/data-streams/run-downsampling.md

+
+### Choosing the downsampling interval
+
+When choosing the downsampling interval, you need to consider the original sampling rate of your measurements. Ideally, you would like an interval that would reduce your number of documents by a significant amount. For example, if a sensor sends data every 10 seconds downsampling to 1 minute would reduce the number of documents by 83%, compared to downsampling to 5 minutes by 96%.


Suggested change

When choosing the downsampling interval, you need to consider the original sampling rate of your measurements. Ideally, you would like an interval that would reduce your number of documents by a significant amount. For example, if a sensor sends data every 10 seconds downsampling to 1 minute would reduce the number of documents by 83%, compared to downsampling to 5 minutes by 96%.

When choosing the downsampling interval, you need to consider the original sampling rate of your measurements. Ideally, you would like an interval that would reduce your number of documents by a significant amount. For example, if a sensor sends data every 10 seconds, downsampling to 1 minute would reduce the number of documents by 83%, compared to downsampling to 5 minutes by 96%.

kkrik-es

Let's wait for Marci to have a pass too.

marciw

I gave this a quick edit -- let me know if anything's unclear :)

manage-data/data-store/data-streams/run-downsampling.md

Co-authored-by: Marci W <[email protected]>

gmarouli · 2025-10-08T08:15:44Z

manage-data/data-store/data-streams/run-downsampling.md

+
+### Reduce the index size (ILM only)
+
+When configuring an ILM policy with downsampling, use the [rollover action](elasticsearch://reference/elasticsearch/index-lifecycle-actions/ilm-rollover.md) in the `hot` phase to control index size. Using smaller indices helps to minimize the impact of downsampling on a cluster's performance. 


I think it's important here to say use and not define. When writing an ILM policy a user needs to define a rollover action no matter what. However, if they are using downsampling, they can consider using this to reduce the size of their index. I want us to be careful to not imply that a user needs to define it only if they are trying to reduce the size. @marciw does this make sense?

Yes, I see what you mean. we could try "set the rollover action to run in the hot phase"

("use" doesn't seem idiomatic to me here, so i'm hoping we can find an alternative)

manage-data/data-store/data-streams/run-downsampling.md

gmarouli · 2025-10-08T08:18:20Z

I am adding @leontyevdv as a reviewer because he recently watched a tutorial on how to configure TSDS and he can tell us how well it reads for a user.

marciw

made a few more more comments but approving to unblock 🚀

In this PR, we propose to add practical tips for downsampling. For now this includes, a guideline on how to choose the downsampling interval. And then specifically for ILM, an explanation on how downsampling relates with tiers. After elastic/elasticsearch#135834, we should also add here the option to disable force merge. --------- Co-authored-by: Kostas Krikellas <[email protected]> Co-authored-by: Marci W <[email protected]>

Add practical tips for downsampling

ccceb1e

gmarouli requested review from a team as code owners October 6, 2025 13:57

gmarouli requested review from kkrik-es and marciw October 6, 2025 13:58

kkrik-es reviewed Oct 6, 2025

View reviewed changes

gmarouli added 2 commits October 7, 2025 09:48

Add ILM specific section

542ef9a

Fix link to migrate action

1e32d3c

kkrik-es reviewed Oct 7, 2025

View reviewed changes

manage-data/data-store/data-streams/run-downsampling.md Outdated Show resolved Hide resolved

gmarouli and others added 2 commits October 7, 2025 11:02

Apply review comments

69f7ddf

Co-authored-by: Kostas Krikellas <[email protected]>

Merge branch 'main' into downsampling-practical-tips

4dcb367

kkrik-es reviewed Oct 7, 2025

View reviewed changes

manage-data/data-store/data-streams/run-downsampling.md Outdated Show resolved Hide resolved

kkrik-es reviewed Oct 7, 2025

View reviewed changes

manage-data/data-store/data-streams/run-downsampling.md Outdated Show resolved Hide resolved

gmarouli and others added 2 commits October 7, 2025 14:44

Apply suggestions from code review

2c7aac1

Co-authored-by: Kostas Krikellas <[email protected]>

Rearrange ILM tips

566abe3

kkrik-es reviewed Oct 7, 2025

View reviewed changes

kkrik-es approved these changes Oct 7, 2025

View reviewed changes

marciw reviewed Oct 7, 2025

View reviewed changes

gmarouli and others added 2 commits October 8, 2025 11:05

Apply suggestions from code review

6de3d3d

Co-authored-by: Marci W <[email protected]>

Tweaks

3244f09

gmarouli commented Oct 8, 2025

View reviewed changes

gmarouli requested a review from marciw October 8, 2025 08:16

gmarouli commented Oct 8, 2025

View reviewed changes

manage-data/data-store/data-streams/run-downsampling.md Outdated Show resolved Hide resolved

Merge branch 'main' into downsampling-practical-tips

abf557b

gmarouli requested a review from leontyevdv October 8, 2025 08:18

gmarouli self-assigned this Oct 8, 2025

leontyevdv approved these changes Oct 8, 2025

View reviewed changes

marciw approved these changes Oct 8, 2025

View reviewed changes

gmarouli added 2 commits October 9, 2025 10:13

Apply comments from review

30ddce8

Merge branch 'main' into downsampling-practical-tips

0eed420

gmarouli merged commit b916a47 into main Oct 9, 2025
7 checks passed

gmarouli deleted the downsampling-practical-tips branch October 9, 2025 07:24

		## Practical tips

		Downsampling requires reading and indexing the contents of a backing index. The following guidelines can help you get the most out of it.


		### Choosing the downsampling interval

		When choosing the downsampling interval, you need to consider the original sampling rate of your measurements. Ideally, you would like an interval that would reduce your number of documents by a significant amount. For example, if a sensor sends data every 10 seconds downsampling to 1 minute would reduce the number of documents by 83%, compared to downsampling to 5 minutes by 96%.


		### Reduce the index size (ILM only)

		When configuring an ILM policy with downsampling, use the [rollover action](elasticsearch://reference/elasticsearch/index-lifecycle-actions/ilm-rollover.md) in the `hot` phase to control index size. Using smaller indices helps to minimize the impact of downsampling on a cluster's performance.

Add practical tips for downsampling. #3340

Add practical tips for downsampling. #3340

Uh oh!

Conversation

gmarouli commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kkrik-es commented Oct 6, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gmarouli commented Oct 6, 2025

Uh oh!

github-actions bot commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Preview links for changed docs

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kkrik-es left a comment

Choose a reason for hiding this comment

Uh oh!

marciw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gmarouli commented Oct 8, 2025

Uh oh!

marciw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gmarouli commented Oct 6, 2025 •

edited

Loading

github-actions bot commented Oct 7, 2025 •

edited

Loading