You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/reference/inference/inference-apis.asciidoc
+4-8Lines changed: 4 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,10 +16,8 @@ models or if you want to use non-NLP models, use the
16
16
For the most up-to-date API details, refer to {api-es}/group/endpoint-inference[{infer-cap} APIs].
17
17
--
18
18
19
-
The {infer} APIs enable you to create {infer} endpoints and use {ml} models of
20
-
different providers - such as Amazon Bedrock, Anthropic, Azure AI Studio,
21
-
Cohere, Google AI, Mistral, OpenAI, or HuggingFace - as a service. Use
22
-
the following APIs to manage {infer} models and perform {infer}:
19
+
The {infer} APIs enable you to create {infer} endpoints and integrate with {ml} models of different services - such as Amazon Bedrock, Anthropic, Azure AI Studio, Cohere, Google AI, Mistral, OpenAI, or HuggingFace.
20
+
Use the following APIs to manage {infer} models and perform {infer}:
23
21
24
22
* <<delete-inference-api>>
25
23
* <<get-inference-api>>
@@ -37,10 +35,8 @@ An {infer} endpoint enables you to use the corresponding {ml} model without
37
35
manual deployment and apply it to your data at ingestion time through
38
36
<<semantic-search-semantic-text, semantic text>>.
39
37
40
-
Choose a model from your provider or use ELSER – a retrieval model trained by
41
-
Elastic –, then create an {infer} endpoint by the <<put-inference-api>>.
42
-
Now use <<semantic-search-semantic-text, semantic text>> to perform
43
-
<<semantic-search, semantic search>> on your data.
38
+
Choose a model from your service or use ELSER – a retrieval model trained by Elastic –, then create an {infer} endpoint by the <<put-inference-api>>.
39
+
Now use <<semantic-search-semantic-text, semantic text>> to perform <<semantic-search, semantic search>> on your data.
Refer to the service list in the <<put-inference-api-desc,API description section>> for the available task types.
45
+
Refer to the integration list in the <<put-inference-api-desc,API description section>> for the available task types.
46
46
--
47
47
48
48
@@ -54,15 +54,15 @@ The create {infer} API enables you to create an {infer} endpoint and configure a
54
54
55
55
[IMPORTANT]
56
56
====
57
-
* When creating an inference endpoint, the associated machine learning model is automatically deployed if it is not already running.
57
+
* When creating an {infer} endpoint, the associated {ml} model is automatically deployed if it is not already running.
58
58
* After creating the endpoint, wait for the model deployment to complete before using it. You can verify the deployment status by using the <<get-trained-models-stats, Get trained model statistics>> API. In the response, look for `"state": "fully_allocated"` and ensure the `"allocation_count"` matches the `"target_allocation_count"`.
59
59
* Avoid creating multiple endpoints for the same model unless required, as each endpoint consumes significant resources.
60
60
====
61
61
62
62
63
-
The following services are available through the {infer} API.
64
-
You can find the available task types next to the service name.
65
-
Click the links to review the configuration details of the services:
63
+
The following integrations are available through the {infer} API.
64
+
You can find the available task types next to the integration name.
65
+
Click the links to review the configuration details of the integrations:
66
66
67
67
* <<infer-service-alibabacloud-ai-search,AlibabaCloud AI Search>> (`completion`, `rerank`, `sparse_embedding`, `text_embedding`)
0 commit comments