Skip to content

Commit 9564a8b

Browse files
Cold tier time-range should not be specified (#65546)
Whether the cold tier can handle years depends a lot on the use case and for instance our BWC guarantees. This would need to be part of a specific sizing exercise, so in the spirit of not over-promising, the description of the cold tier has been changed to not mention years.
1 parent aa8ebeb commit 9564a8b

File tree

1 file changed

+33
-33
lines changed

1 file changed

+33
-33
lines changed

docs/reference/datatiers.asciidoc

Lines changed: 33 additions & 33 deletions
Original file line numberDiff line numberDiff line change
@@ -2,24 +2,24 @@
22
[[data-tiers]]
33
== Data tiers
44

5-
A _data tier_ is a collection of nodes with the same data role that
6-
typically share the same hardware profile:
5+
A _data tier_ is a collection of nodes with the same data role that
6+
typically share the same hardware profile:
77

88
* <<content-tier, Content tier>> nodes handle the indexing and query load for content such as a product catalog.
99
* <<hot-tier, Hot tier>> nodes handle the indexing load for time series data such as logs or metrics
10-
and hold your most recent, most-frequently-accessed data.
11-
* <<warm-tier, Warm tier>> nodes hold time series data that is accessed less-frequently
10+
and hold your most recent, most-frequently-accessed data.
11+
* <<warm-tier, Warm tier>> nodes hold time series data that is accessed less-frequently
1212
and rarely needs to be updated.
1313
* <<cold-tier, Cold tier>> nodes hold time series data that is accessed occasionally and not normally updated.
1414

15-
When you index documents directly to a specific index, they remain on content tier nodes indefinitely.
15+
When you index documents directly to a specific index, they remain on content tier nodes indefinitely.
1616

17-
When you index documents to a data stream, they initially reside on hot tier nodes.
17+
When you index documents to a data stream, they initially reside on hot tier nodes.
1818
You can configure <<index-lifecycle-management, {ilm}>> ({ilm-init}) policies
19-
to automatically transition your time series data through the hot, warm, and cold tiers
20-
according to your performance, resiliency and data retention requirements.
19+
to automatically transition your time series data through the hot, warm, and cold tiers
20+
according to your performance, resiliency and data retention requirements.
2121

22-
A node's <<data-node, data role>> is configured in `elasticsearch.yml`.
22+
A node's <<data-node, data role>> is configured in `elasticsearch.yml`.
2323
For example, the highest-performance nodes in a cluster might be assigned to both the hot and content tiers:
2424

2525
[source,yaml]
@@ -33,9 +33,9 @@ node.roles: ["data_hot", "data_content"]
3333

3434
Data stored in the content tier is generally a collection of items such as a product catalog or article archive.
3535
Unlike time series data, the value of the content remains relatively constant over time,
36-
so it doesn't make sense to move it to a tier with different performance characteristics as it ages.
37-
Content data typically has long data retention requirements, and you want to be able to retrieve
38-
items quickly regardless of how old they are.
36+
so it doesn't make sense to move it to a tier with different performance characteristics as it ages.
37+
Content data typically has long data retention requirements, and you want to be able to retrieve
38+
items quickly regardless of how old they are.
3939

4040
Content tier nodes are usually optimized for query performance--they prioritize processing power over IO throughput
4141
so they can process complex searches and aggregations and return results quickly.
@@ -49,10 +49,10 @@ New indices are automatically allocated to the <<content-tier>> unless they are
4949
[[hot-tier]]
5050
=== Hot tier
5151

52-
The hot tier is the {es} entry point for time series data and holds your most-recent,
53-
most-frequently-searched time series data.
54-
Nodes in the hot tier need to be fast for both reads and writes,
55-
which requires more hardware resources and faster storage (SSDs).
52+
The hot tier is the {es} entry point for time series data and holds your most-recent,
53+
most-frequently-searched time series data.
54+
Nodes in the hot tier need to be fast for both reads and writes,
55+
which requires more hardware resources and faster storage (SSDs).
5656
For resiliency, indices in the hot tier should be configured to use one or more replicas.
5757

5858
New indices that are part of a <<data-streams, data stream>> are automatically allocated to the
@@ -62,51 +62,51 @@ hot tier.
6262
[[warm-tier]]
6363
=== Warm tier
6464

65-
Time series data can move to the warm tier once it is being queried less frequently
66-
than the recently-indexed data in the hot tier.
67-
The warm tier typically holds data from recent weeks.
65+
Time series data can move to the warm tier once it is being queried less frequently
66+
than the recently-indexed data in the hot tier.
67+
The warm tier typically holds data from recent weeks.
6868
Updates are still allowed, but likely infrequent.
69-
Nodes in the warm tier generally don't need to be as fast as those in the hot tier.
69+
Nodes in the warm tier generally don't need to be as fast as those in the hot tier.
7070
For resiliency, indices in the warm tier should be configured to use one or more replicas.
7171

7272
[discrete]
7373
[[cold-tier]]
7474
=== Cold tier
7575

76-
Once data in the warm tier is no longer being updated, it can move to the cold tier.
77-
The cold tier typically holds the data from recent months or years.
76+
Once data is no longer being updated, it can move from the warm tier to the cold tier where it
77+
stays for the rest of its life.
7878
The cold tier is still a responsive query tier, but data in the cold tier is not normally updated.
7979
As data transitions into the cold tier it can be compressed and shrunken.
80-
For resiliency, indices in the cold tier can rely on
81-
<<ilm-searchable-snapshot, searchable snapshots>>, eliminating the need for replicas.
80+
For resiliency, indices in the cold tier can rely on
81+
<<ilm-searchable-snapshot, searchable snapshots>>, eliminating the need for replicas.
8282

8383
[discrete]
8484
[[data-tier-allocation]]
8585
=== Data tier index allocation
8686

87-
When you create an index, by default {es} sets
87+
When you create an index, by default {es} sets
8888
<<tier-preference-allocation-filter, `index.routing.allocation.include._tier_preference`>>
8989
to `data_content` to automatically allocate the index shards to the content tier.
9090

91-
When {es} creates an index as part of a <<data-streams, data stream>>,
92-
by default {es} sets
91+
When {es} creates an index as part of a <<data-streams, data stream>>,
92+
by default {es} sets
9393
<<tier-preference-allocation-filter, `index.routing.allocation.include._tier_preference`>>
9494
to `data_hot` to automatically allocate the index shards to the hot tier.
9595

96-
You can override the automatic tier-based allocation by specifying
96+
You can override the automatic tier-based allocation by specifying
9797
<<shard-allocation-filtering, shard allocation filtering>>
9898
settings in the create index request or index template that matches the new index.
9999

100-
You can also explicitly set `index.routing.allocation.include._tier_preference`
101-
to opt out of the default tier-based allocation.
100+
You can also explicitly set `index.routing.allocation.include._tier_preference`
101+
to opt out of the default tier-based allocation.
102102
If you set the tier preference to `null`, {es} ignores the data tier roles during allocation.
103103

104104
[discrete]
105105
[[data-tier-migration]]
106106
=== Automatic data tier migration
107107

108108
{ilm-init} automatically transitions managed
109-
indices through the available data tiers using the <<ilm-migrate, migrate>> action.
110-
By default, this action is automatically injected in every phase.
111-
You can explicitly specify the migrate action to override the default behavior,
109+
indices through the available data tiers using the <<ilm-migrate, migrate>> action.
110+
By default, this action is automatically injected in every phase.
111+
You can explicitly specify the migrate action to override the default behavior,
112112
or use the <<ilm-allocate, allocate action>> to manually specify allocation rules.

0 commit comments

Comments
 (0)