Test that `ClusterInfo` is refreshed when a new node is added #134101

nicktindall · 2025-09-04T01:41:25Z

As an alternative to adding code to model shard movements to/from nodes with no NodeUsageStatsForThreadPools in the current ClusterInfo, just add a test to confirm that the situation is only very brief, because we eagerly fetch a new ClusterInfo when that occurs.

There didn't seem to be a test that covered this behaviour specifically (it dates back to the dark ages). Interesting that many of the other scenarios (become master 1, fail master 1, become master 2, fail master 2) don't seem to assert that the InternalClusterInfoService actually did anything? You can comment out all the behaviour in org.elasticsearch.cluster.InternalClusterInfoService#clusterChanged and the only bit that fails is the interval polling part.

I can add assertions to those if we think it's worthwhile?

elasticsearchmachine · 2025-09-04T01:42:06Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

nicktindall · 2025-09-04T03:21:28Z

server/src/test/java/org/elasticsearch/cluster/InternalClusterInfoServiceSchedulingTests.java

+            );
+            // Don't use runUntilFlag because we don't want the scheduled task to run
+            deterministicTaskQueue.runAllRunnableTasks();
+            assertTrue(nodeJoined.get());


this may be why the existing tests don't assert anything, it feels a little hacky, but this seemed like the right test to add to? Could also implement as an IT

Copilot

Pull Request Overview

This PR adds a test to verify that the ClusterInfo is refreshed when a new node joins the cluster, addressing a gap in test coverage for this critical behavior. The test ensures that when nodes are added to a cluster, the InternalClusterInfoService eagerly fetches updated cluster information, which is important for proper shard allocation decisions.

Key changes:

Added test coverage for node join and leave scenarios in cluster info service scheduling
Verified that cluster info refresh is triggered when nodes join but not when they leave

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

server/src/test/java/org/elasticsearch/cluster/InternalClusterInfoServiceSchedulingTests.java

…InfoServiceSchedulingTests.java Co-authored-by: Copilot <[email protected]>

ywangd

LGTM

…c#134101)

Test that ClusterInfo is refreshed when a new node is added

2c6e4c5

nicktindall added >test Issues or PRs that are addressing/adding tests :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) labels Sep 4, 2025

elasticsearchmachine added Team:Distributed Coordination Meta label for Distributed Coordination team v9.2.0 labels Sep 4, 2025

nicktindall mentioned this pull request Sep 4, 2025

Model movements to nodes with no existing node stats #133901

Closed

nicktindall added 3 commits September 4, 2025 11:52

Merge branch 'main' into test_clusterinfo_refresh_on_node_join

2388760

Prevent scheduled task interfering with assertions

11d2b3b

Comments

48c2dc9

nicktindall commented Sep 4, 2025

View reviewed changes

nicktindall requested a review from Copilot September 4, 2025 03:22

Copilot AI reviewed Sep 4, 2025

View reviewed changes

server/src/test/java/org/elasticsearch/cluster/InternalClusterInfoServiceSchedulingTests.java Outdated Show resolved Hide resolved

Update server/src/test/java/org/elasticsearch/cluster/InternalCluster…

e57a5d0

…InfoServiceSchedulingTests.java Co-authored-by: Copilot <[email protected]>

nicktindall mentioned this pull request Sep 4, 2025

Use the last good NodeUsageStatsForThreadPools when a node returns an error #133896

Merged

nicktindall requested review from DiannaHohensee, mhl-b and ywangd September 4, 2025 03:28

ywangd approved these changes Sep 4, 2025

View reviewed changes

nicktindall merged commit dc743a9 into elastic:main Sep 4, 2025
33 checks passed

nicktindall deleted the test_clusterinfo_refresh_on_node_join branch September 4, 2025 23:43

jbaiera pushed a commit to jbaiera/elasticsearch that referenced this pull request Sep 5, 2025

Test that ClusterInfo is refreshed when a new node is added (elasti…

5214f2b

…c#134101)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Test that `ClusterInfo` is refreshed when a new node is added #134101

Test that `ClusterInfo` is refreshed when a new node is added #134101

Uh oh!

nicktindall commented Sep 4, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Sep 4, 2025

Uh oh!

nicktindall Sep 4, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

ywangd left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Test that ClusterInfo is refreshed when a new node is added #134101

Test that ClusterInfo is refreshed when a new node is added #134101

Uh oh!

Conversation

nicktindall commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Sep 4, 2025

Uh oh!

nicktindall Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

ywangd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Test that `ClusterInfo` is refreshed when a new node is added #134101

Test that `ClusterInfo` is refreshed when a new node is added #134101

nicktindall commented Sep 4, 2025 •

edited

Loading