Balancer skips disk related computation when disk weight factor is zero #136352

ywangd · 2025-10-10T05:19:44Z

Flamegraph shows Balancer instantiation takes consider amount of time in an allocate call. More than 1/4 of the instantiation time is for computing disk related stats which is wasteful when the disk weight factor is zero. This PR skips these computations in such case.

ywangd · 2025-10-10T05:22:36Z

...ain/java/org/elasticsearch/cluster/routing/allocation/allocator/BalancedShardsAllocator.java

+            avgDiskUsageInBytesPerNode = skipDiskUsageCalculation
+                ? 0
+                : WeightFunction.avgDiskUsageInBytesPerNode(allocation.clusterInfo(), metadata, routingNodes);
+            nodes = Collections.unmodifiableMap(buildModelFromAssigned(skipDiskUsageCalculation));


The cost saving is realistic. My main question is whether the approach is considered hacky.

What about if rather than having the additional flag, we passed the weighting around, and the expensive parts could only perform the calculation if the weighting was non-zero?

I'm not sure if that's better, but it's a thought.

We could refer to this.balancingWeights perhaps?

See below flamegraph (from many-shards benchmark) that shows the time spent on disk related computation (purple color) inside allocate calls.

We synced offline and agreed to change the boolean flag to be a method on BalancingWeights.

…or is zero" This reverts commit e6eef9e.

…tation-conditionally

elasticsearchmachine · 2025-10-13T04:51:16Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

nicktindall

LGTM. Some comments, but nothing worth holding the change up for.

nicktindall · 2025-10-13T06:21:23Z

...ava/org/elasticsearch/cluster/routing/allocation/allocator/BalancedShardsAllocatorTests.java

+            Map.of(),
+            Map.of(),
+            Map.of()
+        );


Nit: could use ClusterInfo.builder().shardSizes(...).build()?

I forgot there is builder. Pushed b7e22fc looks much nicer! Thanks!

nicktindall · 2025-10-13T06:24:32Z

...ain/java/org/elasticsearch/cluster/routing/allocation/allocator/BalancedShardsAllocator.java

        private float maxShardSizeBytes(ProjectIndex index) {
+            if (balancingWeights.diskUsageIgnored()) {
+                return 0;
+            }


This bit I have minor apprehensions about. But it seems like it's not an easy one to skip on the caller side. And we do have that information available to us here via the balancingWeights.

I can remove this change if you prefer. It does not really show up in the flamegraph. So I could be over-zealous.

Yeah maybe that would be nicer. The behaviour seems a little surprising potentially. It would seem safer if the caller was skipping the call rather than the callee just returning zero.

It turns out that we can check it at the call site. Not sure why I initially thought it was not feasible ... Pushed 0affb99
Let me know if this works for you. Thanks!

…tation-conditionally

nicktindall

Looks great, ship it!

ywangd requested a review from nicktindall October 10, 2025 05:19

ywangd added >non-issue :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) v9.3.0 labels Oct 10, 2025

ywangd commented Oct 10, 2025

View reviewed changes

ywangd added 2 commits October 10, 2025 17:03

Revert "Balancer skips disk related computation when disk weight fact…

842957d

…or is zero" This reverts commit e6eef9e.

method on BalancingWeights

97f6e27

elasticsearchmachine added the serverless-linked Added by automation, don't add manually label Oct 10, 2025

ywangd added 2 commits October 13, 2025 14:59

Merge remote-tracking branch 'origin/main' into skip-disk-usage-compu…

7e34f79

…tation-conditionally

add an unit test

137bf84

ywangd marked this pull request as ready for review October 13, 2025 04:50

elasticsearchmachine added the Team:Distributed Coordination Meta label for Distributed Coordination team label Oct 13, 2025

nicktindall approved these changes Oct 13, 2025

View reviewed changes

ywangd added 3 commits October 14, 2025 11:28

check at caller

0affb99

clusterInfo builder

b7e22fc

Merge remote-tracking branch 'origin/main' into skip-disk-usage-compu…

52c7448

…tation-conditionally

nicktindall approved these changes Oct 14, 2025

View reviewed changes

ywangd merged commit c9d59a1 into elastic:main Oct 14, 2025
34 checks passed

Balancer skips disk related computation when disk weight factor is zero #136352

Balancer skips disk related computation when disk weight factor is zero #136352

Conversation

ywangd commented Oct 10, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Oct 13, 2025

Uh oh!

nicktindall left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nicktindall left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants