Move unpromotable relocations to its own transport action #127330

fcofdez · 2025-04-24T14:11:35Z

Relates ES-10339

elasticsearchmachine · 2025-04-24T14:11:59Z

Pinging @elastic/es-distributed-indexing (Team:Distributed Indexing)

elasticsearchmachine · 2025-04-24T14:12:23Z

Hi @fcofdez, I've created a changelog YAML for you.

henningandersen

Looks good though I have a comment on the cleanup.

henningandersen · 2025-04-24T14:52:30Z

server/src/main/java/org/elasticsearch/indices/recovery/PeerRecoveryTargetService.java

-                onGoingRecoveries.markRecoveryAsDone(recoveryId);
-                return null;
-            }), indexShard::preRecovery);
+            try (onCompletion) {


I would think this releases the recovery monitor and the recovery-ref too soon? My intuition would be that it should only be done when the action completes?

My understanding is that the RecoveryTarget would be retained until the recovery is marked as done (since the initial refCount=1 from the AbstractRefCounted corresponds to that decRef). But just to be on the safe side I've reverted to the previous behaviour that would release the RecoveryRef once the action returns.

…-relocation-handoff

This reverts commit 9ef9621.

henningandersen

LGTM

(though I'd like Iraklis to have a look at RecoveriesCollection if possible).

henningandersen · 2025-04-25T13:12:03Z

server/src/main/java/org/elasticsearch/indices/recovery/RecoveriesCollection.java

            throw new IndexShardClosedException(shardId);
        }
        assert recoveryRef.target().shardId().equals(shardId);
-        assert recoveryRef.target().indexShard().routingEntry().isPromotableToPrimary();


Looks like this was added here, I am also not sure I understand why, perhaps @kingherc remember and can confirm that the assertion is not significant?

Not out of the top of my head. But going back to the code, I see we've made a special branch in PeerRecoveryTargetService#doRecovery() with if (indexShard.routingEntry().isPromotableToPrimary() == false) { for unpromotables that basically quick skips all recovery stages, and closes the RecoveryRef as well. So the point of the assertion at the time was that there should be no other coordination needed for unpromotables to justify getting the RecoveryRef.

Seeing though that now this PR introduces some sort of coordination between unpromotables, it probably makes to remove the assertion.

(I did not fully review this PR, but feel free to tell me if I should)

…-relocation-handoff

Move unpromotable relocation to its own transport action

37dd524

Relates ES-10339

fcofdez added >enhancement :Distributed Indexing/Recovery Anything around constructing a new shard, either from a local or a remote source. Team:Distributed Indexing Meta label for Distributed Indexing team labels Apr 24, 2025

elasticsearchmachine added the v9.1.0 label Apr 24, 2025

Update docs/changelog/127330.yaml

03bbbe7

elasticsearchmachine added the serverless-linked Added by automation, don't add manually label Apr 24, 2025

fcofdez requested a review from henningandersen April 24, 2025 14:18

henningandersen reviewed Apr 24, 2025

View reviewed changes

fcofdez added 5 commits April 24, 2025 17:41

Merge remote-tracking branch 'origin/main' into ES-10339-unpromotable…

64cce51

…-relocation-handoff

Use cleanupOnly listener instead

9ef9621

Revert "Use cleanupOnly listener instead"

7aa5332

This reverts commit 9ef9621.

Use cleanupOnly listener instead

c691b69

More changes...

93cc8aa

henningandersen approved these changes Apr 25, 2025

View reviewed changes

fcofdez added 4 commits April 30, 2025 17:30

Merge remote-tracking branch 'origin/main' into ES-10339-unpromotable…

9758d09

…-relocation-handoff

Add unpromotable transport action implementation for tests

03d52fb

Merge remote-tracking branch 'origin/main' into ES-10339-unpromotable…

e57b8f0

…-relocation-handoff

Merge remote-tracking branch 'origin/main' into ES-10339-unpromotable…

48bde1a

…-relocation-handoff

fcofdez merged commit c5c3615 into elastic:main May 1, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move unpromotable relocations to its own transport action #127330

Move unpromotable relocations to its own transport action #127330

Uh oh!

fcofdez commented Apr 24, 2025

Uh oh!

elasticsearchmachine commented Apr 24, 2025

Uh oh!

elasticsearchmachine commented Apr 24, 2025

Uh oh!

henningandersen left a comment

Uh oh!

henningandersen Apr 24, 2025

Uh oh!

fcofdez Apr 24, 2025

Uh oh!

henningandersen left a comment

Uh oh!

henningandersen Apr 25, 2025

Uh oh!

kingherc Apr 25, 2025

Uh oh!

kingherc Apr 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Move unpromotable relocations to its own transport action #127330

Move unpromotable relocations to its own transport action #127330

Uh oh!

Conversation

fcofdez commented Apr 24, 2025

Uh oh!

elasticsearchmachine commented Apr 24, 2025

Uh oh!

elasticsearchmachine commented Apr 24, 2025

Uh oh!

henningandersen left a comment

Choose a reason for hiding this comment

Uh oh!

henningandersen Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

fcofdez Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

henningandersen left a comment

Choose a reason for hiding this comment

Uh oh!

henningandersen Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

kingherc Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

kingherc Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants