Skip to content

Commit d98a134

Browse files
committed
[DOCS] Adds note about differences between chat completion and stream API.
1 parent 22c0264 commit d98a134

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

docs/reference/inference/stream-inference.asciidoc

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -40,6 +40,8 @@ However, if you do not plan to use the {infer} APIs to use these models or if yo
4040
The stream {infer} API enables real-time responses for completion tasks by delivering answers incrementally, reducing response times during computation.
4141
It only works with the `completion` and `chat_completion` task types.
4242

43+
The Chat completion {infer} API and the Stream {infer} API differ in their response structure. If you use the `openai` service or the `elastic` service, use the Chat completion {infer} API.
44+
4345
[NOTE]
4446
====
4547
include::inference-shared.asciidoc[tag=chat-completion-docs]

0 commit comments

Comments
 (0)