Skip to content

Conversation

@tlrx
Copy link
Member

@tlrx tlrx commented Jan 30, 2025

When adding support for upgrading closed indices in N-2 version, I randomized the Translog.Durability setting of the closed index with the aim to test the 2 phases closing process.

This caused at least 1 test failure on Windows with the index being closed and the cluster upgraded before the synchronization of the translog had a chance to be executed. I think this cause the engine to be reset on the replica that is promoted as a primary, causing the loss of the operations that were not yet persisted.

Closes #121257

@tlrx tlrx added >test Issues or PRs that are addressing/adding tests :Distributed Indexing/Recovery Anything around constructing a new shard, either from a local or a remote source. v9.0.0 labels Jan 30, 2025
@elasticsearchmachine elasticsearchmachine added the Team:Distributed Indexing Meta label for Distributed Indexing team label Jan 30, 2025
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-distributed-indexing (Team:Distributed Indexing)

@tlrx tlrx force-pushed the 2025/01/30/fix-121257 branch from a4e9b1f to 95772a5 Compare January 31, 2025 11:05
Copy link
Contributor

@fcofdez fcofdez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tlrx tlrx added the auto-backport Automatically create backport pull requests when merged label Feb 3, 2025
@tlrx tlrx merged commit 106b666 into elastic:main Feb 3, 2025
17 checks passed
@tlrx tlrx deleted the 2025/01/30/fix-121257 branch February 3, 2025 11:05
@tlrx
Copy link
Member Author

tlrx commented Feb 3, 2025

Thanks Francisco

@elasticsearchmachine
Copy link
Collaborator

💔 Backport failed

Status Branch Result
9.0 Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 121278

tlrx added a commit to tlrx/elasticsearch that referenced this pull request Feb 3, 2025
…elastic#121278)

When adding support for upgrading closed indices in
N-2 version, I randomized the Translog.Durability setting
of the closed index with the aim to test the 2 phases
closing process.

This caused at least 1 test failure on Windows with the
index being closed and the cluster upgraded before the
synchronization of the translog had a chance to be
executed. I think this cause the engine to be reset on
the replica that is promoted as a primary, causing the
loss of the operations that were not yet persisted.

Closes elastic#121257
Closes elastic#121365
Closes elastic#121423
elasticsearchmachine pushed a commit that referenced this pull request Feb 3, 2025
…#121278) (#121520)

* [9.0][Test] Remove ASYNC translog durability in N-2 bwc upgrade tests (#121278)

When adding support for upgrading closed indices in
N-2 version, I randomized the Translog.Durability setting
of the closed index with the aim to test the 2 phases
closing process.

This caused at least 1 test failure on Windows with the
index being closed and the cluster upgraded before the
synchronization of the translog had a chance to be
executed. I think this cause the engine to be reset on
the replica that is promoted as a primary, causing the
loss of the operations that were not yet persisted.

Closes #121257
Closes #121365
Closes #121423

* unmute
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

auto-backport Automatically create backport pull requests when merged backport pending :Distributed Indexing/Recovery Anything around constructing a new shard, either from a local or a remote source. Team:Distributed Indexing Meta label for Distributed Indexing team >test Issues or PRs that are addressing/adding tests v9.0.1 v9.1.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[CI] FullClusterRestartLuceneIndexCompatibilityIT class failing

3 participants