[Failure Store] Conceptually introduce the failure store lifecycle #125258

gmarouli · 2025-03-19T20:26:42Z

In this PR we introduce in the DataStream data structure the concept of the failure lifecycle. Currently, the failure store lifecycle and the data stream lifecycle use exactly the same configuration, but this is the most of the required wiring necessary in the DataStreamLifecycleService to support this feature.

The changes include:

Introduction of getters for the two different lifecycle in the DataStream. We also split the retrieval of backing and failure indices past retention. This will give us the flexibility to expand on the failure retention as needed in the future.
Usage of the failure lifecycle getter in the DataStreamLifecycleService during the rollover and the deletion steps.
DataStreamTests.java underwent a lot of changes because of the change in retrieving the data and failure indices past retention. We also merged the tests for data retention and effective retention because of the extensive overlap in the set-up.

… the same)

…le-getter

elasticsearchmachine · 2025-03-19T21:22:41Z

Pinging @elastic/es-data-management (Team:Data Management)

…le-getter

jbaiera

LGTM, left one small comment

jbaiera · 2025-03-21T19:11:53Z

server/src/main/java/org/elasticsearch/cluster/metadata/DataStream.java

+     * NOTE that this specifically does not return the write index of the data stream as usually retention
+     * is treated differently for the write index (i.e. they first need to be rolled over)
+     */
+    public List<Index> getBackingIndicesPastRetention(


This method and the one below seem to share most of their logic, would it make sense to pass in the lifecycle and the indices as arguments and deduplicate the logic?

I have been going back and forth on this. If we do that we will change the intention of the method.

Right now I think it encapsulates a lot of the logic. Meaning, we ask the data stream to figure out which of its backing indices and which of its failure indices should are past retention based on the lifecycle configuration it holds in its internal state.

If we change this to pass the lifecycle and the indices as arguments, we are breaking the encapsulation a bit and it becomes more of a helper method than a Because we could be providing a random list of indices and a random retention. This is not necessarily an issue considering this is only used in one place.

I thought of an intermediate approach. We create a getBackingIndicesPastRetention and we provide a boolean to choose or not choose failure store & the actual retention. This still ensures that indices will belong to the data stream, but it gives us the freedom to define the desired retention and the index component we want. It also allowed me to unify the tests which was a nice plus.

I will include it in the follow up PR because then we can see if it works nicely with the separate retentions

See b87eebf

elasticsearchmachine · 2025-03-26T11:23:04Z

💔 Backport failed

Status	Branch	Result
❌	8.x	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 125258

gmarouli · 2025-03-26T11:58:13Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Questions ?

Please refer to the Backport tool documentation

…lastic#125258) * Specify index component when retrieving lifecycle * Add getters for the failure lifecycle * Conceptually introduce the failure store lifecycle (even for now it's the same) (cherry picked from commit 6503c1b) # Conflicts: # modules/data-streams/src/main/java/org/elasticsearch/datastreams/lifecycle/DataStreamLifecycleService.java # modules/data-streams/src/main/java/org/elasticsearch/datastreams/lifecycle/action/TransportExplainDataStreamLifecycleAction.java # modules/data-streams/src/main/java/org/elasticsearch/datastreams/lifecycle/action/TransportGetDataStreamLifecycleStatsAction.java # server/src/main/java/org/elasticsearch/cluster/metadata/ProjectMetadata.java # server/src/test/java/org/elasticsearch/cluster/metadata/DataStreamTests.java # server/src/test/java/org/elasticsearch/cluster/metadata/MetadataCreateDataStreamServiceTests.java

…125258) (#125657) * Specify index component when retrieving lifecycle * Add getters for the failure lifecycle * Conceptually introduce the failure store lifecycle (even for now it's the same) (cherry picked from commit 6503c1b) # Conflicts: # modules/data-streams/src/main/java/org/elasticsearch/datastreams/lifecycle/DataStreamLifecycleService.java # modules/data-streams/src/main/java/org/elasticsearch/datastreams/lifecycle/action/TransportExplainDataStreamLifecycleAction.java # modules/data-streams/src/main/java/org/elasticsearch/datastreams/lifecycle/action/TransportGetDataStreamLifecycleStatsAction.java # server/src/main/java/org/elasticsearch/cluster/metadata/ProjectMetadata.java # server/src/test/java/org/elasticsearch/cluster/metadata/DataStreamTests.java # server/src/test/java/org/elasticsearch/cluster/metadata/MetadataCreateDataStreamServiceTests.java

…lastic#125258) * Specify index component when retrieving lifecycle * Add getters for the failure lifecycle * Conceptually introduce the failure store lifecycle (even for now it's the same)

gmarouli added 3 commits March 19, 2025 14:14

Specify index component when retrieving lifecycle

e24ff93

Add getters for the failure lifecycle

1036ff8

Conceptually introduce the failure store lifecycle (even for now it's…

804f2fe

… the same)

elasticsearchmachine added the v9.1.0 label Mar 19, 2025

gmarouli added >non-issue :Data Management/Data streams Data streams and their lifecycles v8.19.0 labels Mar 19, 2025

Merge branch 'main' into failure-store/introduce-failure-storelifecyc…

fb45ad6

…le-getter

gmarouli requested a review from jbaiera March 19, 2025 21:22

gmarouli marked this pull request as ready for review March 19, 2025 21:22

elasticsearchmachine added the Team:Data Management Meta label for data/management team label Mar 19, 2025

gmarouli added 3 commits March 21, 2025 10:44

Merge with main

27dce96

Merge branch 'main' into failure-store/introduce-failure-storelifecyc…

39058c5

…le-getter

Merge branch 'main' into failure-store/introduce-failure-storelifecyc…

4e936b8

…le-getter

jbaiera approved these changes Mar 26, 2025

View reviewed changes

gmarouli added the auto-backport Automatically create backport pull requests when merged label Mar 26, 2025

gmarouli merged commit 6503c1b into elastic:main Mar 26, 2025
17 checks passed

gmarouli deleted the failure-store/introduce-failure-storelifecycle-getter branch March 26, 2025 11:21

elasticsearchmachine added the backport pending label Mar 26, 2025

gmarouli mentioned this pull request Mar 26, 2025

[8.x] [Failure Store] Conceptually introduce the failure store lifecycle (#125258) #125657

Merged

gmarouli removed the backport pending label Mar 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Failure Store] Conceptually introduce the failure store lifecycle #125258

[Failure Store] Conceptually introduce the failure store lifecycle #125258

Uh oh!

gmarouli commented Mar 19, 2025

Uh oh!

elasticsearchmachine commented Mar 19, 2025

Uh oh!

jbaiera left a comment

Uh oh!

jbaiera Mar 21, 2025

Uh oh!

gmarouli Mar 26, 2025 •

edited

Loading

Uh oh!

gmarouli Mar 26, 2025

Uh oh!

Uh oh!

elasticsearchmachine commented Mar 26, 2025

Uh oh!

gmarouli commented Mar 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Failure Store] Conceptually introduce the failure store lifecycle #125258

[Failure Store] Conceptually introduce the failure store lifecycle #125258

Uh oh!

Conversation

gmarouli commented Mar 19, 2025

Uh oh!

elasticsearchmachine commented Mar 19, 2025

Uh oh!

jbaiera left a comment

Choose a reason for hiding this comment

Uh oh!

jbaiera Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

gmarouli Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gmarouli Mar 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

elasticsearchmachine commented Mar 26, 2025

💔 Backport failed

Uh oh!

gmarouli commented Mar 26, 2025

💚 All backports created successfully

Questions ?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gmarouli Mar 26, 2025 •

edited

Loading