Skip to content

Commit eab7596

Browse files
authored
Fix references to _unified inference API (#676)
1 parent 49cef65 commit eab7596

File tree

4 files changed

+7
-7
lines changed

4 files changed

+7
-7
lines changed

solutions/search/inference-api/chat-completion-inference-api.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -23,9 +23,9 @@ The {{infer}} APIs enable you to use certain services, such as built-in {{ml}} m
2323

2424
## {{api-request-title}} [chat-completion-inference-api-request]
2525

26-
`POST /_inference/<inference_id>/_unified`
26+
`POST /_inference/<inference_id>/_stream`
2727

28-
`POST /_inference/chat_completion/<inference_id>/_unified`
28+
`POST /_inference/chat_completion/<inference_id>/_stream`
2929

3030

3131
## {{api-prereq-title}} [chat-completion-inference-api-prereqs]
@@ -38,8 +38,8 @@ The {{infer}} APIs enable you to use certain services, such as built-in {{ml}} m
3838

3939
The chat completion {{infer}} API enables real-time responses for chat completion tasks by delivering answers incrementally, reducing response times during computation. It only works with the `chat_completion` task type for `openai` and `elastic` {{infer}} services.
4040

41-
::::{note}
42-
* The `chat_completion` task type is only available within the _unified API and only supports streaming.
41+
::::{note}
42+
* The `chat_completion` task type is only available within the `_stream` API and only supports streaming.
4343
* The Chat completion {{infer}} API and the Stream {{infer}} API differ in their response structure and capabilities. The Chat completion {{infer}} API provides more comprehensive customization options through more fields and function calling support. If you use the `openai` service or the `elastic` service, use the Chat completion {{infer}} API.
4444

4545
::::

solutions/search/inference-api/elastic-inference-service-eis.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ Creates an {{infer}} endpoint to perform an {{infer}} task with the `elastic` se
3636

3737

3838
::::{note}
39-
The `chat_completion` task type only supports streaming and only through the `_unified` API.
39+
The `chat_completion` task type only supports streaming and only through the `_stream` API.
4040

4141
For more information on how to use the `chat_completion` task type, please refer to the [chat completion documentation](/solutions/search/inference-api/chat-completion-inference-api.md).
4242

solutions/search/inference-api/openai-inference-integration.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ Creates an {{infer}} endpoint to perform an {{infer}} task with the `openai` ser
3737

3838

3939
::::{note}
40-
The `chat_completion` task type only supports streaming and only through the `_unified` API.
40+
The `chat_completion` task type only supports streaming and only through the `_stream` API.
4141

4242
For more information on how to use the `chat_completion` task type, please refer to the [chat completion documentation](https://www.elastic.co/guide/en/elasticsearch/reference/current/chat-completion-inference-api.html).
4343

solutions/search/semantic-search/cohere-es.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -258,7 +258,7 @@ Rerank the results using the new {{infer}} endpoint.
258258

259259
```py
260260
# Pass the query and the search results to the service
261-
response = client.inference.inference(
261+
response = client.inference.rerank(
262262
inference_id="cohere_rerank",
263263
body={
264264
"query": query,

0 commit comments

Comments
 (0)