[Test] Wait on master node for shard started #131172

ywangd · 2025-07-14T02:07:51Z

The shard started may not be visible on the master node if the wait is on a data node. In that case, the DiskThreshold monitor may use stale cluster state for releasing read-only blocks. This PR fixes it by waiting on the master node, which is the behaviour before #129872.

Resolves: #131146

The shard started may not be visible on the master node if the wait is on a data node. In that case, the DiskThreshold monitor may use stale cluster state for releasing read-only blocks. This PR fixes it by waiting on the master node, which is the behaviour before elastic#129872. Resolves: elastic#131146

elasticsearchmachine · 2025-07-14T02:08:16Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

DaveCTurner

LGTM

...nalClusterTest/java/org/elasticsearch/cluster/routing/allocation/DiskThresholdMonitorIT.java

…routing/allocation/DiskThresholdMonitorIT.java Co-authored-by: David Turner <[email protected]>

ywangd · 2025-07-14T07:34:53Z

@elasticmachine update branch

ywangd · 2025-07-14T22:06:14Z

@elasticmachine update branch

The shard started may not be visible on the master node if the wait is on a data node. In that case, the DiskThreshold monitor may use stale cluster state for releasing read-only blocks. This PR fixes it by waiting on the master node, which is the behaviour before elastic#129872. Resolves: elastic#131146

ywangd requested review from DaveCTurner and pxsalehi July 14, 2025 02:07

ywangd added >test Issues or PRs that are addressing/adding tests :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) v9.2.0 labels Jul 14, 2025

elasticsearchmachine added the Team:Distributed Coordination Meta label for Distributed Coordination team label Jul 14, 2025

ywangd mentioned this pull request Jul 14, 2025

[CI] DiskThresholdMonitorIT testFloodStageExceeded failing #131146

Closed

DaveCTurner approved these changes Jul 14, 2025

View reviewed changes

...nalClusterTest/java/org/elasticsearch/cluster/routing/allocation/DiskThresholdMonitorIT.java Outdated Show resolved Hide resolved

Update server/src/internalClusterTest/java/org/elasticsearch/cluster/…

acd8969

…routing/allocation/DiskThresholdMonitorIT.java Co-authored-by: David Turner <[email protected]>

ywangd added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jul 14, 2025

Merge branch 'main' into es-131146-fix

77e1cc9

pxsalehi approved these changes Jul 14, 2025

View reviewed changes

elasticmachine and others added 3 commits July 14, 2025 18:06

Merge branch 'main' into es-131146-fix

eacbff7

Merge remote-tracking branch 'origin/main' into es-131146-fix

ef90ac0

unmute

b713053

elasticsearchmachine merged commit 072e6c7 into elastic:main Jul 15, 2025
33 checks passed

ywangd deleted the es-131146-fix branch July 15, 2025 01:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Test] Wait on master node for shard started #131172

[Test] Wait on master node for shard started #131172

Uh oh!

ywangd commented Jul 14, 2025

Uh oh!

elasticsearchmachine commented Jul 14, 2025

Uh oh!

DaveCTurner left a comment

Uh oh!

Uh oh!

ywangd commented Jul 14, 2025

Uh oh!

ywangd commented Jul 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

[Test] Wait on master node for shard started #131172

[Test] Wait on master node for shard started #131172

Uh oh!

Conversation

ywangd commented Jul 14, 2025

Uh oh!

elasticsearchmachine commented Jul 14, 2025

Uh oh!

DaveCTurner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ywangd commented Jul 14, 2025

Uh oh!

ywangd commented Jul 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants