You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Set adaptive resources to Low to allow ML to scale down to 0 # of allocations when there are no active inference requests
When using the inference API for Elasticsearch or ELSER, enable adaptive_allocations which will allow ML to scale down the models to 0 # of allocations when there are no active inference requests