Update service-elser.asciidoc

georgewallace · web-flow · commit 02466aade8e8 · 2024-11-12T16:14:42.000-07:00
diff --git a/docs/reference/inference/service-elser.asciidoc b/docs/reference/inference/service-elser.asciidoc
@@ -98,12 +98,12 @@ Must be a power of 2. Max allowed value is 32.
 
 [discrete]
 [[inference-example-elser-adaptive-allocation]]
-==== Setting adaptive allocations for the ELSER service
+==== ELSER service example
 
 NOTE: For more information on how to optimize your ELSER endpoints, refer to {ml-docs}/ml-nlp-elser.html#elser-recommendations[the ELSER recommendations] section in the model documentation.
 To learn more about model autoscaling, refer to the {ml-docs}/ml-nlp-auto-scale.html[trained model autoscaling] page.
 
-The following example shows how to create an {infer} endpoint called `my-elser-model` to perform a `sparse_embedding` task type and configure adaptive allocations.
+The following example shows how to create an {infer} endpoint called `my-elser-model` to perform a `sparse_embedding` task type and configure adaptive allocations (recommended).
 
 The request below will automatically download the ELSER model if it isn't already downloaded and then deploy the model.
 
@@ -126,11 +126,13 @@ PUT _inference/sparse_embedding/my-elser-model
 
 [discrete]
 [[inference-example-elser]]
-==== ELSER service example
+==== Creating ELSER service without adaptive allocations
 
 The following example shows how to create an {infer} endpoint called `my-elser-model` to perform a `sparse_embedding` task type.
 Refer to the {ml-docs}/ml-nlp-elser.html[ELSER model documentation] for more info.
 
+The following example shows how to create an {infer} endpoint called `my-elser-model` to perform a `sparse_embedding` task type when adaptive allocations isn't required or {ml-docs}/ml-nlp-auto-scale.html[trained model autoscaling] isn't available.
+
 NOTE: If you want to optimize your ELSER endpoint for ingest, set the number of threads to `1` (`"num_threads": 1`).
 If you want to optimize your ELSER endpoint for search, set the number of threads to greater than `1`.