Performance Regression for every CS update from ILM's org.elasticsearch.cluster.metadata.Metadata#isIndexManagedByILM

Going over the many shards benchmark bootstrapping I noticed it slowed down quite a bit recently.

Turns out a big contributor to this is `org.elasticsearch.cluster.metadata.Metadata#isIndexManagedByILM` called from 
`org.elasticsearch.xpack.ilm.IndexLifecycleService#triggerPolicies` on every cluster state update and costing O(N) in the number of indices.

<img width="1728" alt="image" src="https://github.com/elastic/elasticsearch/assets/6490959/52f8dfed-1f5a-4bac-b4bc-09b262992cce">

This could be made more efficient in various ways: 

At least we should:
* remove setting read in the hot loop
* stop using `Metadata.getIndicesLookup`, this one is extremely expensive on the applier thread

a first quick fix would be to first check if any datastreams even use DLM and if the answer is no, the whole logic can be skipped. This currently introduces an about 5% overhead into every CS update (relative to stuff like create index and shard allocation in the many shards benchmark) at 25k indices in a cluster and the overhead grows in O(number_of_indices).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance Regression for every CS update from ILM's org.elasticsearch.cluster.metadata.Metadata#isIndexManagedByILM #98992

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Performance Regression for every CS update from ILM's org.elasticsearch.cluster.metadata.Metadata#isIndexManagedByILM #98992

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions