Measure search load per index #122262

piergm · 2025-02-11T15:04:23Z

With this PR we introduce a way to track EMWA and total time spent executing tasks for each index in the search thread-pool.
We extended TaskExecutionTimeTrackingEsThreadPoolExecutor that already has logic to track globally (not per-index) EWMA and time spent executing tasks in the search thread pool to track on a per-index basis in TaskExecutionTimeTrackingPerIndexEsThreadPoolExecutor. We decided to extend this logic in order not to duplicate the time tracking logic.
We take care of new indices being tracked with the computeIfAbsent inside the trackExecutionTime method and take care or removing deleted indices with the cluster state listener SearchIndexTimeTrackingCleanupService

…earch into mp_search_load-per-index_update

drempapis · 2025-03-07T14:12:56Z

Do we really need this when we already have org.elasticsearch.search.SearchService.SearchOperationListenerExecutor#SearchOperationListenerExecutor(org.elasticsearch.search.internal.SearchContext) which seems to do the exact same thing as all of this code does for reporting to APM? Shouldn't we move both things to the same codebase?

I mentioned this to Luca a couple minutes ago, we're already running into massive I-cache miss rates for this logic, this PR will make that situation even worse and as we already do time tracking per shard in one way we should be able to piggy back on that?

The code has been refactored based on Armin's suggestion to register a SearchOperationListener to track the execution time per index. To verify the functionality, the following test was performed locally. When deploying a serverless cluster (master/search/ML), adding three empty indices index_1, index_2, index_3, and using locust to benchmark the deployment by sending match_all request to the indices with a load of 60%, 30%, 10%.
For the reference, the code is the following

@task
    def benchmark_search_load(self):
        headers = {"Content-Type": "application/json"}
        query = {
            "query": {
                "match_all": {}
            }
        }
        index_request = "/{}/_search".format(self.weighted_random_index())

        with self.client.post(index_request, data=json.dumps(query), headers=headers, catch_response=True) as response:
            if response.status_code == 200:
                response.success()
            else:
                response.failure(f"Failed! Status Code: {response.status_code}")


    def weighted_random_index(self):
        probabilities = [0.6, 0.3, 0.1]  # 60%, 30%, 10%
        indices = ['index_1', 'index_2', 'index_3']
        return random.choices(indices, weights=probabilities, k=1)[0]

The benchmark after running for a while is the following,

On the ES side, we get a periodic print of the recorded execution time per index (in nanos).

We also normalize the execution time per index to [0,1] with a total sum of 1, getting, as expected, a load that converges to the request percentage per index -> 0.6 / 0.3 / 0.1

drempapis · 2025-03-07T14:23:16Z

I'm not sure about all this history appearing. To work on this issue, I did the following

git remote add matteo-fork [email protected]:piergm/elasticsearch.git
git fetch matteo-fork 
git checkout -b mp_search_load-per-index  matteo-fork/mp_search_load-per-index 
git checkout -b mp_searfh_load-per-index_update
git merge main mp_searfh_load-per-index_update 
git merge mp_searfh_load-per-index_update mp_search_load-per-index
git push matteo-fork HEAD:mp_search_load-per-index

drempapis · 2025-03-07T14:25:14Z

.idea/runConfigurations/Debug_Elasticsearch__node_2_.xml

    <method v="2" />
  </configuration>
-</component>
+</component>


I tried to revert this, but the newline persists there.

It's intelliJ's fault, try removing it from vim/nano (with IJ closed) or another editor then commit the change and it should work 😄

drempapis · 2025-03-07T14:26:44Z

server/src/main/java/org/elasticsearch/index/shard/SearchOperationListener.java

+     * @param tookInNanos the number of nanoseconds the query execution took
     */
-    default void onFailedQueryPhase(SearchContext searchContext) {}
+    default void onFailedQueryPhase(SearchContext searchContext, long tookInNanos) {}


We also need to track the execution time when a phase fails.

drempapis · 2025-03-07T14:37:32Z

...c/main/java/org/elasticsearch/index/search/stats/ShardSearchPerIndexTimeTrackingMetrics.java

+     * @param indexName the name of the index
+     * @return the EWMA of the execution time for the index
+     */
+    public double getLoadEMWAPerIndex(String indexName) {


The EMWA is still under consideration if we need to calculate and export it.

Maybe some more context here:
We are calculating load on a per-index basis, loads are then collected and summed up (TODO) in the master node. With this information we will need to calculate which of the indices where under the most load and act based on that. The idea is to then normalize the "global" index load and act on the normalized values.
That said the per-node EMWA is not really suitable to be summed across nodes in our opinion.
That's why this comment.

original-brownbear · 2025-03-07T14:45:49Z

...c/main/java/org/elasticsearch/index/search/stats/ShardSearchPerIndexTimeTrackingMetrics.java

+
+    private static final Logger logger = LogManager.getLogger(ShardSearchPerIndexTimeTrackingMetrics.class);
+
+    private final ConcurrentHashMap<String, Tuple<LongAdder, ExponentiallyWeightedMovingAverage>> indexExecutionTime;


I'm sorry for saying it in such a straightforward manner, but do we really want to add more logic based on this class?

Especially on a per-shard basis, using the math in ExponentiallyWeightedMovingAverage seems questionable.

We calculate newValue = alpha * lastValue + (1 - alpha) * currentValue. In a large number of use-cases you may see the fetch and query times be an order of magnitude apart. So now, assuming the query always matches, we will essentially flap between two values constantly for a shard?

Why not just use the existing metrics we have in org.elasticsearch.index.search.stats.ShardSearchStats? EWMA makes no sense here, if anything isn't total query time and it's derivative what we care about?

Good point, we left it in because we where not sure about it and I asked Dimi to leave a comment on the PR about this.

All right, let's check how it can be adapted into the ShardSearchStats

original-brownbear

I don't think we should do this given that we have ShardSearchStats already. If we miss any metric we should add it to that thing shouldn't we?

drempapis added 30 commits January 3, 2025 14:51

unmute tests

9d9b6af

revert

02ddf82

Merge remote-tracking branch 'upstream/main'

bf79ef3

Merge remote-tracking branch 'upstream/main'

f1c91bd

Merge remote-tracking branch 'upstream/main'

ba67bff

Merge remote-tracking branch 'upstream/main'

2c3654a

Merge remote-tracking branch 'upstream/main'

58d4762

Merge remote-tracking branch 'upstream/main'

bc38090

Merge remote-tracking branch 'upstream/main'

ef0447b

Merge remote-tracking branch 'upstream/main'

fe009d7

Merge remote-tracking branch 'upstream/main'

a747a40

Merge remote-tracking branch 'upstream/main'

f3e47ae

Merge remote-tracking branch 'upstream/main'

2bc0107

Merge remote-tracking branch 'upstream/main'

f3b3d00

Merge remote-tracking branch 'upstream/main'

f52789e

Merge remote-tracking branch 'upstream/main'

ec243b7

Merge remote-tracking branch 'upstream/main'

f93eb9b

Merge remote-tracking branch 'upstream/main'

2777916

Merge remote-tracking branch 'upstream/main'

623bd7b

Merge remote-tracking branch 'upstream/main'

3fee6af

Merge remote-tracking branch 'upstream/main'

af3fff9

Merge remote-tracking branch 'upstream/main'

a41bbad

Merge remote-tracking branch 'upstream/main'

8115a61

Merge remote-tracking branch 'upstream/main'

6b2361e

Merge remote-tracking branch 'upstream/main'

41dcc1c

Merge remote-tracking branch 'upstream/main'

2df7f62

Merge remote-tracking branch 'upstream/main'

6b95b0c

Merge remote-tracking branch 'upstream/main'

eadf8cf

Merge remote-tracking branch 'upstream/main'

41dc557

Merge remote-tracking branch 'upstream/main'

e0e1740

drempapis added 12 commits March 6, 2025 16:23

update

23cc8ca

update

d38a1f5

update

202aef4

update

e044b26

update

887ec8e

update

1eb1381

update

c725c0c

update

7cfcf5c

update

76cdebc

revert code

a2e879e

update

a08adf4

update

66011ec

drempapis requested a review from a team as a code owner March 7, 2025 13:35

drempapis and others added 6 commits March 7, 2025 15:36

revert code

0188bb9

[CI] Auto commit changes from spotless

387caa3

update for review

491a91e

Merge branch 'mp_search_load-per-index' of github.com:piergm/elastics…

c71baf9

…earch into mp_search_load-per-index_update

revert config

2d187be

Merge branch 'main' into mp_search_load-per-index

b13c457

drempapis added 2 commits March 7, 2025 16:16

Fix missing newline at end of XML file

02b19f9

Fix newline at end of XML file

e9002db

drempapis reviewed Mar 7, 2025

View reviewed changes

original-brownbear reviewed Mar 7, 2025

View reviewed changes

original-brownbear suggested changes Mar 7, 2025

View reviewed changes

drempapis closed this May 21, 2025

javanna removed the v9.1.0 label May 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Measure search load per index #122262

Measure search load per index #122262

Uh oh!

piergm commented Feb 11, 2025 •

edited

Loading

Uh oh!

drempapis commented Mar 7, 2025 •

edited

Loading

Uh oh!

drempapis commented Mar 7, 2025 •

edited

Loading

Uh oh!

drempapis Mar 7, 2025

Uh oh!

piergm Mar 7, 2025

Uh oh!

drempapis Mar 7, 2025

Uh oh!

drempapis Mar 7, 2025

Uh oh!

piergm Mar 7, 2025

Uh oh!

original-brownbear Mar 7, 2025

Uh oh!

piergm Mar 7, 2025

Uh oh!

drempapis Mar 7, 2025 •

edited

Loading

Uh oh!

original-brownbear left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants


		private static final Logger logger = LogManager.getLogger(ShardSearchPerIndexTimeTrackingMetrics.class);

		private final ConcurrentHashMap<String, Tuple<LongAdder, ExponentiallyWeightedMovingAverage>> indexExecutionTime;

Measure search load per index #122262

Measure search load per index #122262

Uh oh!

Conversation

piergm commented Feb 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

drempapis commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

drempapis commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

drempapis Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

piergm Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

drempapis Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

drempapis Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

piergm Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

original-brownbear Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

piergm Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

drempapis Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

original-brownbear left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

piergm commented Feb 11, 2025 •

edited

Loading

drempapis commented Mar 7, 2025 •

edited

Loading

drempapis commented Mar 7, 2025 •

edited

Loading

drempapis Mar 7, 2025 •

edited

Loading