-
Notifications
You must be signed in to change notification settings - Fork 153
Add practical tips for downsampling. #3340
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from 7 commits
ccceb1e
542ef9a
1e32d3c
69f7ddf
4dcb367
2c7aac1
566abe3
6de3d3d
3244f09
abf557b
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change | ||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
@@ -102,6 +102,38 @@ Set `fixed_interval` to your preferred level of granularity. The original time s | |||||||||||||||||
::: | ||||||||||||||||||
:::: | ||||||||||||||||||
|
||||||||||||||||||
## Practical tips | ||||||||||||||||||
|
||||||||||||||||||
Downsampling requires reading and indexing the contents of a backing index. The following guidelines can help you get the most out of it. | ||||||||||||||||||
gmarouli marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||||||||||||||||||
|
||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Do we need a note about rollover? To avoid creating backing indices that are too big.. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I have been going back and forth for this. For ILM it's easy because it's part of the policy, for data stream lifecycle, I would suggest that if we really think that it should be less maybe we should set it to something less. Right? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. You mean, update the default? We can do that at a later point, but what about older versions, or ILM configurations with existing rollover overrides? It could still help to suggest a best practice here. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yes, we could update the default, that would apply on all version unless the user chose to overwrite it. I restructure it a bit so we can have ILM focused recommendations. But if we think it should be reduced, we should consider updating the default for DLM as well. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Let's file a tracking issue for this, so that we don't forget. |
||||||||||||||||||
### Choosing the downsampling interval | ||||||||||||||||||
gmarouli marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||||||||||||||||||
|
||||||||||||||||||
When choosing the downsampling interval, you need to consider the original sampling rate of your measurements. Ideally, you would like an interval that would reduce your number of documents by a significant amount. For example, if a sensor sends data every 10 seconds downsampling to 1 minute would reduce the number of documents by 83%, compared to downsampling to 5 minutes by 96%. | ||||||||||||||||||
|
When choosing the downsampling interval, you need to consider the original sampling rate of your measurements. Ideally, you would like an interval that would reduce your number of documents by a significant amount. For example, if a sensor sends data every 10 seconds downsampling to 1 minute would reduce the number of documents by 83%, compared to downsampling to 5 minutes by 96%. | |
When choosing the downsampling interval, you need to consider the original sampling rate of your measurements. Ideally, you would like an interval that would reduce your number of documents by a significant amount. For example, if a sensor sends data every 10 seconds, downsampling to 1 minute would reduce the number of documents by 83%, compared to downsampling to 5 minutes by 96%. |
gmarouli marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
gmarouli marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would delete this heading and intro sentence, and then promote the "Phases and tiers" and "Reduce the index size" headings to level 3, so that they're parallel with "choose an optimal downsampling interval"
(I suggested specific changes, just explaining why I'd delete these lines)
You can just make it clear in the text that those best practices are for ILM only, instead of having an ILM subsection
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
### Downsampling with Index Lifecycle Management | |
The following tips apply to data streams downsampled by index lifecycle management (ILM). | |
### Downsampling with Index Lifecycle Management | |
The following tips apply to data streams downsampled by index lifecycle management (ILM). |
Uh oh!
There was an error while loading. Please reload this page.