Skip to content

Conversation

@ywangd
Copy link
Member

@ywangd ywangd commented Jul 14, 2025

The shard started may not be visible on the master node if the wait is on a data node. In that case, the DiskThreshold monitor may use stale cluster state for releasing read-only blocks. This PR fixes it by waiting on the master node, which is the behaviour before #129872.

Resolves: #131146

The shard started may not be visible on the master node if the wait is
on a data node. In that case, the DiskThreshold monitor may use stale
cluster state for releasing read-only blocks. This PR fixes it by
waiting on the master node, which is the behaviour before elastic#129872.

Resolves: elastic#131146
@ywangd ywangd requested review from DaveCTurner and pxsalehi July 14, 2025 02:07
@ywangd ywangd added >test Issues or PRs that are addressing/adding tests :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) v9.2.0 labels Jul 14, 2025
@elasticsearchmachine elasticsearchmachine added the Team:Distributed Coordination Meta label for Distributed Coordination team label Jul 14, 2025
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

Copy link
Contributor

@DaveCTurner DaveCTurner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

…routing/allocation/DiskThresholdMonitorIT.java

Co-authored-by: David Turner <[email protected]>
@ywangd ywangd added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jul 14, 2025
@ywangd
Copy link
Member Author

ywangd commented Jul 14, 2025

@elasticmachine update branch

@ywangd
Copy link
Member Author

ywangd commented Jul 14, 2025

@elasticmachine update branch

@elasticsearchmachine elasticsearchmachine merged commit 072e6c7 into elastic:main Jul 15, 2025
33 checks passed
@ywangd ywangd deleted the es-131146-fix branch July 15, 2025 01:25
mridula-s109 pushed a commit to mridula-s109/elasticsearch that referenced this pull request Jul 17, 2025
The shard started may not be visible on the master node if the wait is
on a data node. In that case, the DiskThreshold monitor may use stale
cluster state for releasing read-only blocks. This PR fixes it by
waiting on the master node, which is the behaviour before elastic#129872.

Resolves: elastic#131146
mridula-s109 pushed a commit to mridula-s109/elasticsearch that referenced this pull request Jul 17, 2025
The shard started may not be visible on the master node if the wait is
on a data node. In that case, the DiskThreshold monitor may use stale
cluster state for releasing read-only blocks. This PR fixes it by
waiting on the master node, which is the behaviour before elastic#129872.

Resolves: elastic#131146
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) Team:Distributed Coordination Meta label for Distributed Coordination team >test Issues or PRs that are addressing/adding tests v9.2.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[CI] DiskThresholdMonitorIT testFloodStageExceeded failing

5 participants