[Serverless]: Provide information on how users can manage ML VCU costs

### Serverless Docs

Welcome to Elastic Serverless

### Description

It can be helpful to add another bullet point under this section https://www.elastic.co/guide/en/serverless/current/elasticsearch-billing.html#elasticsearch-billing-managing-elasticsearch-costs that talks about the two ways to control the ML VCU costs:

- Set [adaptive resources to Low](https://www.elastic.co/guide/en/machine-learning/current/ml-nlp-auto-scale.html#_adaptive_resources_enabled) to allow ML to scale down to 0 # of allocations when there are no active inference requests
- When using the [inference API](https://www.elastic.co/guide/en/elasticsearch/reference/current/put-inference-api.html) for Elasticsearch or ELSER, enable `adaptive_allocations` which will allow ML to scale down the models to 0 # of allocations when there are no active inference requests



### Resources and additional context

https://www.elastic.co/guide/en/serverless/current/elasticsearch-billing.html#elasticsearch-billing-managing-elasticsearch-costs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Serverless]: Provide information on how users can manage ML VCU costs #229

Serverless Docs

Description

Resources and additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Serverless]: Provide information on how users can manage ML VCU costs #229

Description

Serverless Docs

Description

Resources and additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions