Skip to content

Commit c1de2fb

Browse files
authored
Machine learning trained model autoscaling recommendations for cost reduction (#236)
1 parent 8e893d2 commit c1de2fb

File tree

1 file changed

+6
-4
lines changed

1 file changed

+6
-4
lines changed

serverless/pages/pricing.asciidoc

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,8 @@ The number of VCUs you need is determined by:
1717
* Search Power setting
1818
* Machine learning usage
1919

20+
For detailed {es-serverless} project rates, see the https://www.elastic.co/pricing/serverless-search[{es-serverless} pricing page].
21+
2022
[discrete]
2123
[[elasticsearch-billing-information-about-the-vcu-types-search-ingest-and-ml]]
2224
== VCU types: Search, Indexing, and ML
@@ -39,13 +41,13 @@ queries per second (QPS) you require.
3941
[[elasticsearch-billing-managing-elasticsearch-costs]]
4042
== Managing {es} costs
4143

42-
You can control costs by using a lower Search Power setting or reducing the amount
43-
of retained data.
44+
You can control costs using the following strategies:
4445

4546
* **Search Power setting:** <<elasticsearch-manage-project-search-power-settings,Search Power>> controls the speed of searches against your data. With Search Power, you can
4647
improve search performance by adding more resources for querying, or you can reduce provisioned
4748
resources to cut costs.
4849
* **Time series data retention:** By limiting the number of days of <<elasticsearch-ingest-time-series-data,time series data>> that are available for caching,
4950
you can reduce the number of search VCUs required.
50-
51-
For detailed {es-serverless} project rates, see the https://www.elastic.co/pricing/serverless-search[{es-serverless} pricing page].
51+
* **Machine learning trained model autoscaling:** Configure your trained model deployment to allow it to scale down to zero allocations when there are no active inference requests:
52+
** When starting or updating a trained model deployment, <<enabling-autoscaling-in-kibana-adaptive-resources,Enable adaptive resources>> and set the VCU usage level to *Low*.
53+
** When using the inference API for Elasticsearch or ELSER, <<enabling-autoscaling-through-apis-adaptive-allocations,enable `adaptive_allocations`>>.

0 commit comments

Comments
 (0)