@@ -31,27 +31,46 @@ use the <<ml-df-trained-models-apis>>.
31
31
* Requires the `manage_inference` <<privileges-list-cluster,cluster privilege>>
32
32
(the built-in `inference_admin` role grants this privilege)
33
33
34
+ [discrete]
35
+ [[put-inference-api-path-params]]
36
+ ==== {api-path-parms-title}
37
+
38
+ `<inference_id>`::
39
+ (Required, string)
40
+ include::inference-shared.asciidoc[tag=inference-id]
41
+
42
+ `<task_type>`::
43
+ (Required, string)
44
+ include::inference-shared.asciidoc[tag=task-type]
45
+ +
46
+ --
47
+ Refer to the service list in the <<put-inference-api-desc,API description section>> for the available task types.
48
+ --
49
+
50
+
34
51
[discrete]
35
52
[[put-inference-api-desc]]
36
53
==== {api-description-title}
37
54
38
55
The create {infer} API enables you to create an {infer} endpoint and configure a {ml} model to perform a specific {infer} task.
39
56
40
- The following services are available through the {infer} API, click the links to review the configuration details of the services:
41
-
42
- * <<infer-service-alibabacloud-ai-search,AlibabaCloud AI Search>>
43
- * <<infer-service-amazon-bedrock,Amazon Bedrock>>
44
- * <<infer-service-anthropic,Anthropic>>
45
- * <<infer-service-azure-ai-studio,Azure AI Studio>>
46
- * <<infer-service-azure-openai,Azure OpenAI>>
47
- * <<infer-service-cohere,Cohere>>
48
- * <<infer-service-elasticsearch,Elasticsearch>> (for built-in models and models uploaded through Eland)
49
- * <<infer-service-elser,ELSER>>
50
- * <<infer-service-google-ai-studio,Google AI Studio>>
51
- * <<infer-service-google-vertex-ai,Google Vertex AI>>
52
- * <<infer-service-hugging-face,Hugging Face>>
53
- * <<infer-service-mistral,Mistral>>
54
- * <<infer-service-openai,OpenAI>>
57
+ The following services are available through the {infer} API.
58
+ You can find the available task types next to the service name.
59
+ Click the links to review the configuration details of the services:
60
+
61
+ * <<infer-service-alibabacloud-ai-search,AlibabaCloud AI Search>> (`rerank`, `sparse_embedding`, `text_embedding`)
62
+ * <<infer-service-amazon-bedrock,Amazon Bedrock>> (`completion`, `text_embedding`)
63
+ * <<infer-service-anthropic,Anthropic>> (`completion`)
64
+ * <<infer-service-azure-ai-studio,Azure AI Studio>> (`completion`, `text_embedding`)
65
+ * <<infer-service-azure-openai,Azure OpenAI>> (`completion`, `text_embedding`)
66
+ * <<infer-service-cohere,Cohere>> (`completion`, `rerank`, `text_embedding`)
67
+ * <<infer-service-elasticsearch,Elasticsearch>> (`rerank`, `sparse_embedding`, `text_embedding` - this service is for built-in models and models uploaded through Eland)
68
+ * <<infer-service-elser,ELSER>> (`sparse_embedding`)
69
+ * <<infer-service-google-ai-studio,Google AI Studio>> (`completion`, `text_embedding`)
70
+ * <<infer-service-google-vertex-ai,Google Vertex AI>> (`rerank`, `text_embedding`)
71
+ * <<infer-service-hugging-face,Hugging Face>> (`text_embedding`)
72
+ * <<infer-service-mistral,Mistral>> (`text_embedding`)
73
+ * <<infer-service-openai,OpenAI>> (`completion`, `text_embedding`)
55
74
56
75
The {es} and ELSER services run on a {ml} node in your {es} cluster. The rest of
57
76
the services connect to external providers.
0 commit comments