diff --git a/docs/reference/inference/inference-apis.asciidoc b/docs/reference/inference/inference-apis.asciidoc index 4f27409973ca2..6c97f388788f7 100644 --- a/docs/reference/inference/inference-apis.asciidoc +++ b/docs/reference/inference/inference-apis.asciidoc @@ -16,10 +16,8 @@ models or if you want to use non-NLP models, use the For the most up-to-date API details, refer to {api-es}/group/endpoint-inference[{infer-cap} APIs]. -- -The {infer} APIs enable you to create {infer} endpoints and use {ml} models of -different providers - such as Amazon Bedrock, Anthropic, Azure AI Studio, -Cohere, Google AI, Mistral, OpenAI, or HuggingFace - as a service. Use -the following APIs to manage {infer} models and perform {infer}: +The {infer} APIs enable you to create {infer} endpoints and integrate with {ml} models of different services - such as Amazon Bedrock, Anthropic, Azure AI Studio, Cohere, Google AI, Mistral, OpenAI, or HuggingFace. +Use the following APIs to manage {infer} models and perform {infer}: * <> * <> @@ -37,10 +35,8 @@ An {infer} endpoint enables you to use the corresponding {ml} model without manual deployment and apply it to your data at ingestion time through <>. -Choose a model from your provider or use ELSER – a retrieval model trained by -Elastic –, then create an {infer} endpoint by the <>. -Now use <> to perform -<> on your data. +Choose a model from your service or use ELSER – a retrieval model trained by Elastic –, then create an {infer} endpoint by the <>. +Now use <> to perform <> on your data. [discrete] [[adaptive-allocations]] diff --git a/docs/reference/inference/put-inference.asciidoc b/docs/reference/inference/put-inference.asciidoc index da07d1d3e7d84..4e149667d6298 100644 --- a/docs/reference/inference/put-inference.asciidoc +++ b/docs/reference/inference/put-inference.asciidoc @@ -42,7 +42,7 @@ include::inference-shared.asciidoc[tag=inference-id] include::inference-shared.asciidoc[tag=task-type] + -- -Refer to the service list in the <> for the available task types. +Refer to the integration list in the <> for the available task types. -- @@ -54,15 +54,15 @@ The create {infer} API enables you to create an {infer} endpoint and configure a [IMPORTANT] ==== -* When creating an inference endpoint, the associated machine learning model is automatically deployed if it is not already running. +* When creating an {infer} endpoint, the associated {ml} model is automatically deployed if it is not already running. * After creating the endpoint, wait for the model deployment to complete before using it. You can verify the deployment status by using the <> API. In the response, look for `"state": "fully_allocated"` and ensure the `"allocation_count"` matches the `"target_allocation_count"`. * Avoid creating multiple endpoints for the same model unless required, as each endpoint consumes significant resources. ==== -The following services are available through the {infer} API. -You can find the available task types next to the service name. -Click the links to review the configuration details of the services: +The following integrations are available through the {infer} API. +You can find the available task types next to the integration name. +Click the links to review the configuration details of the integrations: * <> (`completion`, `rerank`, `sparse_embedding`, `text_embedding`) * <> (`completion`, `text_embedding`) @@ -80,14 +80,14 @@ Click the links to review the configuration details of the services: * <> (`text_embedding`) * <> (`text_embedding`, `rerank`) -The {es} and ELSER services run on a {ml} node in your {es} cluster. The rest of -the services connect to external providers. +The {es} and ELSER services run on a {ml} node in your {es} cluster. +The rest of the integrations connect to external services. [discrete] [[adaptive-allocations-put-inference]] ==== Adaptive allocations -Adaptive allocations allow inference services to dynamically adjust the number of model allocations based on the current load. +Adaptive allocations allow inference endpoints to dynamically adjust the number of model allocations based on the current load. When adaptive allocations are enabled: diff --git a/docs/reference/inference/service-alibabacloud-ai-search.asciidoc b/docs/reference/inference/service-alibabacloud-ai-search.asciidoc index 5a11190549ce6..eea0e094dce5a 100644 --- a/docs/reference/inference/service-alibabacloud-ai-search.asciidoc +++ b/docs/reference/inference/service-alibabacloud-ai-search.asciidoc @@ -1,5 +1,5 @@ [[infer-service-alibabacloud-ai-search]] -=== AlibabaCloud AI Search {infer} service +=== AlibabaCloud AI Search {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-amazon-bedrock.asciidoc b/docs/reference/inference/service-amazon-bedrock.asciidoc index ed25ce0d515b5..d4ae3895b7c76 100644 --- a/docs/reference/inference/service-amazon-bedrock.asciidoc +++ b/docs/reference/inference/service-amazon-bedrock.asciidoc @@ -1,5 +1,5 @@ [[infer-service-amazon-bedrock]] -=== Amazon Bedrock {infer} service +=== Amazon Bedrock {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-anthropic.asciidoc b/docs/reference/inference/service-anthropic.asciidoc index 4ce76dc1d57bd..08d8ca43daea8 100644 --- a/docs/reference/inference/service-anthropic.asciidoc +++ b/docs/reference/inference/service-anthropic.asciidoc @@ -1,5 +1,5 @@ [[infer-service-anthropic]] -=== Anthropic {infer} service +=== Anthropic {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-azure-ai-studio.asciidoc b/docs/reference/inference/service-azure-ai-studio.asciidoc index 7ada8df1ecdaa..b179a87de1594 100644 --- a/docs/reference/inference/service-azure-ai-studio.asciidoc +++ b/docs/reference/inference/service-azure-ai-studio.asciidoc @@ -1,5 +1,5 @@ [[infer-service-azure-ai-studio]] -=== Azure AI studio {infer} service +=== Azure AI studio {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-azure-openai.asciidoc b/docs/reference/inference/service-azure-openai.asciidoc index 170c0939166f7..eded44b7ab0b0 100644 --- a/docs/reference/inference/service-azure-openai.asciidoc +++ b/docs/reference/inference/service-azure-openai.asciidoc @@ -1,5 +1,5 @@ [[infer-service-azure-openai]] -=== Azure OpenAI {infer} service +=== Azure OpenAI {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-cohere.asciidoc b/docs/reference/inference/service-cohere.asciidoc index 70e311c810cdd..e95f0810fd29d 100644 --- a/docs/reference/inference/service-cohere.asciidoc +++ b/docs/reference/inference/service-cohere.asciidoc @@ -1,5 +1,5 @@ [[infer-service-cohere]] -=== Cohere {infer} service +=== Cohere {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-elasticsearch.asciidoc b/docs/reference/inference/service-elasticsearch.asciidoc index 8870fbed357a6..745b14904dd6d 100644 --- a/docs/reference/inference/service-elasticsearch.asciidoc +++ b/docs/reference/inference/service-elasticsearch.asciidoc @@ -1,5 +1,5 @@ [[infer-service-elasticsearch]] -=== Elasticsearch {infer} service +=== Elasticsearch {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-elser.asciidoc b/docs/reference/inference/service-elser.asciidoc index 47aaa58814602..6a509ec850903 100644 --- a/docs/reference/inference/service-elser.asciidoc +++ b/docs/reference/inference/service-elser.asciidoc @@ -1,5 +1,5 @@ [[infer-service-elser]] -=== ELSER {infer} service +=== ELSER {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-google-ai-studio.asciidoc b/docs/reference/inference/service-google-ai-studio.asciidoc index 5b30292fb9beb..a6f7d914decfa 100644 --- a/docs/reference/inference/service-google-ai-studio.asciidoc +++ b/docs/reference/inference/service-google-ai-studio.asciidoc @@ -1,5 +1,5 @@ [[infer-service-google-ai-studio]] -=== Google AI Studio {infer} service +=== Google AI Studio {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-google-vertex-ai.asciidoc b/docs/reference/inference/service-google-vertex-ai.asciidoc index 28fa65b6e5fcc..f9499de7e5602 100644 --- a/docs/reference/inference/service-google-vertex-ai.asciidoc +++ b/docs/reference/inference/service-google-vertex-ai.asciidoc @@ -1,5 +1,5 @@ [[infer-service-google-vertex-ai]] -=== Google Vertex AI {infer} service +=== Google Vertex AI {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-hugging-face.asciidoc b/docs/reference/inference/service-hugging-face.asciidoc index 862914c141740..40fb2002975dd 100644 --- a/docs/reference/inference/service-hugging-face.asciidoc +++ b/docs/reference/inference/service-hugging-face.asciidoc @@ -1,5 +1,5 @@ [[infer-service-hugging-face]] -=== HuggingFace {infer} service +=== HuggingFace {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-jinaai.asciidoc b/docs/reference/inference/service-jinaai.asciidoc index 7c5aebe5bcf8e..1470c58315430 100644 --- a/docs/reference/inference/service-jinaai.asciidoc +++ b/docs/reference/inference/service-jinaai.asciidoc @@ -1,5 +1,5 @@ [[infer-service-jinaai]] -=== JinaAI {infer} service +=== JinaAI {infer} integration Creates an {infer} endpoint to perform an {infer} task with the `jinaai` service. diff --git a/docs/reference/inference/service-mistral.asciidoc b/docs/reference/inference/service-mistral.asciidoc index 326e8458be767..20e1133e8a83c 100644 --- a/docs/reference/inference/service-mistral.asciidoc +++ b/docs/reference/inference/service-mistral.asciidoc @@ -1,5 +1,5 @@ [[infer-service-mistral]] -=== Mistral {infer} service +=== Mistral {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-openai.asciidoc b/docs/reference/inference/service-openai.asciidoc index 590f280b1c494..8d7c6c937333d 100644 --- a/docs/reference/inference/service-openai.asciidoc +++ b/docs/reference/inference/service-openai.asciidoc @@ -1,5 +1,5 @@ [[infer-service-openai]] -=== OpenAI {infer} service +=== OpenAI {infer} integration .New API reference [sidebar] diff --git a/docs/reference/inference/service-watsonx-ai.asciidoc b/docs/reference/inference/service-watsonx-ai.asciidoc index e7bba7b4e9a97..31d246a36d350 100644 --- a/docs/reference/inference/service-watsonx-ai.asciidoc +++ b/docs/reference/inference/service-watsonx-ai.asciidoc @@ -1,5 +1,5 @@ [[infer-service-watsonx-ai]] -=== Watsonx {infer} service +=== Watsonx {infer} integration .New API reference [sidebar]