Deduplicate allocation stats calls #123246

DaveCTurner · 2025-02-24T09:35:07Z

These things can be quite expensive and there's no need to recompute
them in parallel across all management threads as done today. This
commit adds a deduplicator to avoid redundant work.

These things can be quite expensive and there's no need to recompute them in parallel across all management threads as done today. This commit adds a deduplicator to avoid redundant work.

elasticsearchmachine · 2025-02-24T09:35:33Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

elasticsearchmachine · 2025-02-24T09:35:34Z

Hi @DaveCTurner, I've created a changelog YAML for you.

original-brownbear

Looks just fine except I think we need to change where we fork still?

original-brownbear · 2025-02-24T11:09:43Z

...ava/org/elasticsearch/action/admin/cluster/allocation/TransportGetAllocationStatsAction.java

            actionFilters,
            TransportGetAllocationStatsAction.Request::new,
            TransportGetAllocationStatsAction.Response::new,
            threadPool.executor(ThreadPool.Names.MANAGEMENT)


You wouldn't want to fork anymore with the deduplicator would you? Only fork in case you actually do the computation?

Technically yes that's right of course. I'm always in two minds about adding more non-forking actions: they increase the risk of serious pain in future (for nontechnical reasons) at the cost of a little more latency right now. I'll let you have this one without prejudice tho 😉

You'd think forking lowers the risk of pain the future, but under load already are at the other end of this.
Forking is far more expensive than just the fork in the real world. You also need to account for the fact that you'll allocate a buffer off of the channel's thread and release that buffer which comes with contention very quickly unfortunately. Just to illustrate this a little :)

original-brownbear

LGTM Thanks!

original-brownbear · 2025-02-24T12:01:06Z

...rg/elasticsearch/action/admin/cluster/allocation/TransportGetAllocationStatsActionTests.java

+        final var startBarrier = new CyclicBarrier(threads.length);
+        for (int i = 0; i < threads.length; i++) {
+            threads[i] = new Thread(() -> {
+                safeAwait(startBarrier);


NIT: could use org.elasticsearch.test.ESTestCase#startInParallel here ? :)

These things can be quite expensive and there's no need to recompute them in parallel across all management threads as done today. This commit adds a deduplicator to avoid redundant work.

elasticsearchmachine · 2025-02-24T13:22:53Z

💔 Backport failed

Status	Branch	Result
❌	8.18	Commit could not be cherrypicked due to conflicts
❌	8.x	Commit could not be cherrypicked due to conflicts
✅	9.0
❌	8.16	Commit could not be cherrypicked due to conflicts
❌	8.17	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 123246

These things can be quite expensive and there's no need to recompute them in parallel across all management threads as done today. This commit adds a deduplicator to avoid redundant work. Backport of elastic#123246 to `8.x`

DaveCTurner · 2025-02-24T14:20:28Z

8.x-and-earlier backports are #123267

These things can be quite expensive and there's no need to recompute them in parallel across all management threads as done today. This commit adds a deduplicator to avoid redundant work.

These things can be quite expensive and there's no need to recompute them in parallel across all management threads as done today. This commit adds a deduplicator to avoid redundant work. Backport of #123246 to `8.x`

These things can be quite expensive and there's no need to recompute them in parallel across all management threads as done today. This commit adds a deduplicator to avoid redundant work. Backport of elastic#123246 to `8.x`

These things can be quite expensive and there's no need to recompute them in parallel across all management threads as done today. This commit adds a deduplicator to avoid redundant work. Backport of #123246 to `8.x`

Deduplicate allocation stats calls

99c0385

These things can be quite expensive and there's no need to recompute them in parallel across all management threads as done today. This commit adds a deduplicator to avoid redundant work.

DaveCTurner added >bug :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) v8.18.1 v8.19.0 v9.0.1 v9.1.0 v8.17.3 labels Feb 24, 2025

DaveCTurner requested a review from original-brownbear February 24, 2025 09:35

elasticsearchmachine added the Team:Distributed Coordination Meta label for Distributed Coordination team label Feb 24, 2025

Update docs/changelog/123246.yaml

c234cfb

original-brownbear reviewed Feb 24, 2025

View reviewed changes

DaveCTurner added 2 commits February 24, 2025 11:57

Merge branch 'main' into 2025/02/24/deduplicate-allocation-stats

16277c5

Fork less

1694deb

original-brownbear approved these changes Feb 24, 2025

View reviewed changes

original-brownbear mentioned this pull request Feb 24, 2025

Only fork TransportGetAllocationStatsAction if heavy allocation stats are requested #123225

Closed

DaveCTurner added auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) auto-backport Automatically create backport pull requests when merged v8.16.5 labels Feb 24, 2025

elasticsearchmachine merged commit 187b192 into elastic:main Feb 24, 2025
17 checks passed

DaveCTurner deleted the 2025/02/24/deduplicate-allocation-stats branch February 24, 2025 13:21

DaveCTurner mentioned this pull request Feb 24, 2025

[9.0] Deduplicate allocation stats calls (#123246) #123263

Merged

elasticsearchmachine added the backport pending label Feb 24, 2025

DaveCTurner mentioned this pull request Feb 24, 2025

Deduplicate allocation stats calls #123267

Merged

DaveCTurner removed the backport pending label Feb 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Deduplicate allocation stats calls #123246

Deduplicate allocation stats calls #123246

Uh oh!

DaveCTurner commented Feb 24, 2025

Uh oh!

elasticsearchmachine commented Feb 24, 2025

Uh oh!

elasticsearchmachine commented Feb 24, 2025

Uh oh!

original-brownbear left a comment

Uh oh!

original-brownbear Feb 24, 2025

Uh oh!

DaveCTurner Feb 24, 2025

Uh oh!

original-brownbear Feb 24, 2025

Uh oh!

original-brownbear left a comment

Uh oh!

original-brownbear Feb 24, 2025

Uh oh!

DaveCTurner Feb 24, 2025

Uh oh!

Uh oh!

elasticsearchmachine commented Feb 24, 2025

Uh oh!

DaveCTurner commented Feb 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Deduplicate allocation stats calls #123246

Deduplicate allocation stats calls #123246

Uh oh!

Conversation

DaveCTurner commented Feb 24, 2025

Uh oh!

elasticsearchmachine commented Feb 24, 2025

Uh oh!

elasticsearchmachine commented Feb 24, 2025

Uh oh!

original-brownbear left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear Feb 24, 2025

Choose a reason for hiding this comment

Uh oh!

DaveCTurner Feb 24, 2025

Choose a reason for hiding this comment

Uh oh!

original-brownbear Feb 24, 2025

Choose a reason for hiding this comment

Uh oh!

original-brownbear left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear Feb 24, 2025

Choose a reason for hiding this comment

Uh oh!

DaveCTurner Feb 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

elasticsearchmachine commented Feb 24, 2025

💔 Backport failed

Uh oh!

DaveCTurner commented Feb 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants