Skip to content

Commit 36d6816

Browse files
committed
Adds default inference andpoints information
1 parent d745315 commit 36d6816

File tree

1 file changed

+12
-12
lines changed

1 file changed

+12
-12
lines changed

docs/reference/inference/inference-apis.asciidoc

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -48,21 +48,21 @@ When adaptive allocations are enabled:
4848

4949
For more information about adaptive allocations and resources, refer to the {ml-docs}/ml-nlp-auto-scale.html[trained model autoscaling] documentation.
5050

51-
//[discrete]
52-
//[[default-enpoints]]
53-
//=== Default {infer} endpoints
51+
[discrete]
52+
[[default-enpoints]]
53+
=== Default {infer} endpoints
5454

55-
//Your {es} deployment contains some preconfigured {infer} endpoints that makes it easier for you to use them when defining `semantic_text` fields or {infer} processors.
56-
//The following list contains the default {infer} endpoints listed by `inference_id`:
55+
Your {es} deployment contains some preconfigured {infer} endpoints that makes it easier for you to use them when defining `semantic_text` fields or {infer} processors.
56+
The following list contains the default {infer} endpoints listed by `inference_id`:
5757

58-
//* `.elser-2-elasticsearch`: uses the {ml-docs}/ml-nlp-elser.html[ELSER] built-in trained model for `sparse_embedding` tasks (recommended for English language texts)
59-
//* `.multilingual-e5-small-elasticsearch`: uses the {ml-docs}/ml-nlp-e5.html[E5] built-in trained model for `text_embedding` tasks (recommended for non-English language texts)
58+
* `.elser-2-elasticsearch`: uses the {ml-docs}/ml-nlp-elser.html[ELSER] built-in trained model for `sparse_embedding` tasks (recommended for English language texts)
59+
* `.multilingual-e5-small-elasticsearch`: uses the {ml-docs}/ml-nlp-e5.html[E5] built-in trained model for `text_embedding` tasks (recommended for non-English language texts)
6060

61-
//Use the `inference_id` of the endpoint in a <<semantic-text,`semantic_text`>> field definition or when creating an <<inference-processor,{infer} processor>>.
62-
//The API call will automatically download and deploy the model which might take a couple of minutes.
63-
//Default {infer} enpoints have {ml-docs}/ml-nlp-auto-scale.html#nlp-model-adaptive-allocations[adaptive allocations] enabled.
64-
//For these models, the minimum number of allocations is `0`.
65-
//If there is no {infer} activity that uses the endpoint, the number of allocations will scale down to `0` automatically after 15 minutes.
61+
Use the `inference_id` of the endpoint in a <<semantic-text,`semantic_text`>> field definition or when creating an <<inference-processor,{infer} processor>>.
62+
The API call will automatically download and deploy the model which might take a couple of minutes.
63+
Default {infer} enpoints have {ml-docs}/ml-nlp-auto-scale.html#nlp-model-adaptive-allocations[adaptive allocations] enabled.
64+
For these models, the minimum number of allocations is `0`.
65+
If there is no {infer} activity that uses the endpoint, the number of allocations will scale down to `0` automatically after 15 minutes.
6666

6767

6868
[discrete]

0 commit comments

Comments
 (0)