Skip to content

Commit 02466aa

Browse files
Update service-elser.asciidoc
1 parent 4abef8e commit 02466aa

File tree

1 file changed

+5
-3
lines changed

1 file changed

+5
-3
lines changed

docs/reference/inference/service-elser.asciidoc

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -98,12 +98,12 @@ Must be a power of 2. Max allowed value is 32.
9898

9999
[discrete]
100100
[[inference-example-elser-adaptive-allocation]]
101-
==== Setting adaptive allocations for the ELSER service
101+
==== ELSER service example
102102

103103
NOTE: For more information on how to optimize your ELSER endpoints, refer to {ml-docs}/ml-nlp-elser.html#elser-recommendations[the ELSER recommendations] section in the model documentation.
104104
To learn more about model autoscaling, refer to the {ml-docs}/ml-nlp-auto-scale.html[trained model autoscaling] page.
105105

106-
The following example shows how to create an {infer} endpoint called `my-elser-model` to perform a `sparse_embedding` task type and configure adaptive allocations.
106+
The following example shows how to create an {infer} endpoint called `my-elser-model` to perform a `sparse_embedding` task type and configure adaptive allocations (recommended).
107107

108108
The request below will automatically download the ELSER model if it isn't already downloaded and then deploy the model.
109109

@@ -126,11 +126,13 @@ PUT _inference/sparse_embedding/my-elser-model
126126

127127
[discrete]
128128
[[inference-example-elser]]
129-
==== ELSER service example
129+
==== Creating ELSER service without adaptive allocations
130130

131131
The following example shows how to create an {infer} endpoint called `my-elser-model` to perform a `sparse_embedding` task type.
132132
Refer to the {ml-docs}/ml-nlp-elser.html[ELSER model documentation] for more info.
133133

134+
The following example shows how to create an {infer} endpoint called `my-elser-model` to perform a `sparse_embedding` task type when adaptive allocations isn't required or {ml-docs}/ml-nlp-auto-scale.html[trained model autoscaling] isn't available.
135+
134136
NOTE: If you want to optimize your ELSER endpoint for ingest, set the number of threads to `1` (`"num_threads": 1`).
135137
If you want to optimize your ELSER endpoint for search, set the number of threads to greater than `1`.
136138

0 commit comments

Comments
 (0)