Skip to content

Conversation

@ywangd
Copy link
Member

@ywangd ywangd commented Oct 31, 2025

Windows can sometimes be slow at handling the large number of cluster state updates.

Resolves: #137384

Windows can sometimes be slow at handling the large number of cluster
state updates.

Resolves: elastic#137384
@ywangd ywangd requested a review from DaveCTurner October 31, 2025 00:29
@ywangd ywangd added >test Issues or PRs that are addressing/adding tests :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) auto-backport Automatically create backport pull requests when merged v9.2.1 v9.3.0 v8.19.7 v9.1.7 labels Oct 31, 2025
@elasticsearchmachine elasticsearchmachine added the Team:Distributed Coordination Meta label for Distributed Coordination team label Oct 31, 2025
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

}

public void testDelayWithALargeAmountOfShards() throws Exception {
assumeFalse("Windows can be too slow on handling larget amount of cluster state updates", Constants.WINDOWS);
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As commented on the issue, one alternative is to extend the timeout on Windows. But it's already 2min and I don't particularly like to increase it more and it does not feel all that useful to keep the test for Windows either.

Copy link
Contributor

@DaveCTurner DaveCTurner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm I think I'd rather wait longer in this case. 2 minutes seems like a long time but it's still kinda arbitrary to fail the test if the worker is even slower than that, and equally arbitrary to exclude a whole platform from some tests just for this reason.

@ywangd
Copy link
Member Author

ywangd commented Nov 5, 2025

Sure I pushed 77d30ee to double the waiting time on Windows.

@ywangd ywangd requested a review from DaveCTurner November 5, 2025 04:56
Copy link
Contributor

@DaveCTurner DaveCTurner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM tho an indefinite wait (on all platforms) would also be fine.

@ywangd
Copy link
Member Author

ywangd commented Nov 5, 2025

tho an indefinite wait (on all platforms) would also be fine.

Noted. Will go for this option if it ends up failing again.

@ywangd ywangd added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Nov 5, 2025
@elasticsearchmachine elasticsearchmachine merged commit 7a23516 into elastic:main Nov 5, 2025
34 checks passed
@ywangd ywangd deleted the es-137384-fix branch November 5, 2025 10:22
ywangd added a commit to ywangd/elasticsearch that referenced this pull request Nov 5, 2025
)

Windows can sometimes be slow at handling the large number of cluster
state updates.

Resolves: elastic#137384
@elasticsearchmachine
Copy link
Collaborator

💚 Backport successful

Status Branch Result
9.2
8.19
9.1

ywangd added a commit to ywangd/elasticsearch that referenced this pull request Nov 5, 2025
)

Windows can sometimes be slow at handling the large number of cluster
state updates.

Resolves: elastic#137384
elasticsearchmachine pushed a commit that referenced this pull request Nov 5, 2025
…137614)

Windows can sometimes be slow at handling the large number of cluster
state updates.

Resolves: #137384
elasticsearchmachine pushed a commit that referenced this pull request Nov 5, 2025
…137615)

Windows can sometimes be slow at handling the large number of cluster
state updates.

Resolves: #137384
elasticsearchmachine pushed a commit that referenced this pull request Nov 5, 2025
…137613)

Windows can sometimes be slow at handling the large number of cluster
state updates.

Resolves: #137384
Kubik42 pushed a commit to Kubik42/elasticsearch that referenced this pull request Nov 10, 2025
)

Windows can sometimes be slow at handling the large number of cluster
state updates.

Resolves: elastic#137384
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

auto-backport Automatically create backport pull requests when merged auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) Team:Distributed Coordination Meta label for Distributed Coordination team >test Issues or PRs that are addressing/adding tests v8.19.7 v9.1.7 v9.2.1 v9.3.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[CI] ClusterRerouteIT testDelayWithALargeAmountOfShards failing

3 participants