Stream responses by sonic182 · Pull Request #29 · doofinder/llm_composer

sonic182 · 2025-07-29T11:36:51Z

Stream responses from providers

Requires the usage of Finch adapter, maybe other Tesla adapters works

for now:

OpenAi
OpenRouter
Ollama

dmoralesl

The implementation looks clean and good. But I have a couple of questions about how this works:

The stream option and the structured_output feature are compatible? How they can work together? If it is not, how this feature helps the AIAssistant to provide real-time outputs?
What will happen when a batch of the streaming response fail? The response will have a missing part or the whole request will fail? Is it possible what I'm saying or the providers (Ollama, Openrouter...) already handle these cases for us?

sonic182 · 2025-07-29T14:20:04Z

The stream option and the structured_output feature are compatible? How they can work together? If it is not, how this feature helps the AIAssistant to provide real-time outputs?

Yes, it is compatible, it is a bit problematic to handle it but it is allowed (see https://openrouter.ai/docs/features/structured-outputs#streaming-with-structured-outputs)

What will happen when a batch of the streaming response fail? The response will have a missing part or the whole request will fail? Is it possible what I'm saying or the providers (Ollama, Openrouter...) already handle these cases for us?

I don't understand the "batch of streaming response", you mean that the connection may get lost while receiving data? if so, the chat will have the text it could read

dmoralesl · 2025-07-29T15:08:09Z

I don't understand the "batch of streaming response", you mean that the connection may get lost while receiving data? if so, the chat will have the text it could read

I mean, each "batch", "iteration", "part" (name it as you like) of the streaming contains a part of the response. Lets imagine the response is fulfilled by 3 iterations ["This is", "a full", "response"] and the second one fails for any reason. The whole streaming will fail or the final response will be "This is response"?
Maybe it is a dump question, in that case just ignore this.

sonic182 · 2025-07-29T15:09:26Z

I don't understand the "batch of streaming response", you mean that the connection may get lost while receiving data? if so, the chat will have the text it could read

I mean, each "batch", "iteration", "part" (name it as you like) of the streaming contains a part of the response. Lets imagine the response is fulfilled by 3 iterations ["This is", "a full", "response"] and the second one fails for any reason. The whole streaming will fail or the final response will be "This is response"? Maybe it is a dump question, in that case just ignore this.

whenever it fails, it will stop, it is not possible the case you describe

mmacia

You're going to upgrade Tesla dep because of this PR I submitted to Tesla elixir-tesla/tesla#767

sonic182 · 2025-07-29T15:29:49Z

You're going to upgrade Tesla dep because of this PR I submitted to Tesla elixir-tesla/tesla#767

This happens if the Provider is in http2, ok is interesting to know it.

Lib users could enforce http1.1 https://hexdocs.pm/finch/Finch.html#start_link/1-pool-configuration-options or just to specify a newer finch version with your fix whenever it arrives

sonic182 · 2025-07-29T15:35:03Z

You're going to upgrade Tesla dep because of this PR I submitted to Tesla elixir-tesla/tesla#767

I did update the min tesla version in mix.exs

hectorperez

Awesome Johanderson!

Two minor suggestions below and 🚀

lib/llm_composer/http_client.ex

README.md

Co-authored-by: Hector Perez <hecpeare@gmail.com>

sonic182 added 6 commits July 29, 2025 13:36

stream responses for openai and openrouter

2b72b0c

default config

a906d73

fix credo

302c8e9

stream read response from ollama

4a74d3e

readme and function in llm_composer mod

7db949a

more info in README

d575980

sonic182 requested review from dmoralesl, hectorperez and mmacia July 29, 2025 13:16

sonic182 added 3 commits July 29, 2025 15:20

added note

3bf8b97

fix example

5ebcb51

a bit more doc

290dc25

dmoralesl reviewed Jul 29, 2025

View reviewed changes

lower finch min

af4356c

dmoralesl self-requested a review July 29, 2025 15:12

dmoralesl approved these changes Jul 29, 2025

View reviewed changes

mmacia reviewed Jul 29, 2025

View reviewed changes

updated min version of Tesla because fix in finch

63cf673

mmacia self-requested a review July 29, 2025 15:35

mmacia approved these changes Jul 29, 2025

View reviewed changes

fix in openrouter

dfe9e41

hectorperez approved these changes Jul 30, 2025

View reviewed changes

lib/llm_composer/http_client.ex Outdated Show resolved Hide resolved

README.md Show resolved Hide resolved

sonic182 and others added 2 commits July 31, 2025 09:36

Update lib/llm_composer/http_client.ex

449baa5

Co-authored-by: Hector Perez <hecpeare@gmail.com>

Update README.md

c05989c

Co-authored-by: Hector Perez <hecpeare@gmail.com>

sonic182 merged commit 2c219e0 into master Jul 31, 2025
6 checks passed

sonic182 deleted the feature/stream_responses branch July 31, 2025 07:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stream responses#29

Stream responses#29
sonic182 merged 14 commits intomasterfrom
feature/stream_responses

sonic182 commented Jul 29, 2025 •

edited

Loading

Uh oh!

dmoralesl left a comment

Uh oh!

sonic182 commented Jul 29, 2025

Uh oh!

dmoralesl commented Jul 29, 2025

Uh oh!

sonic182 commented Jul 29, 2025

Uh oh!

mmacia left a comment

Uh oh!

sonic182 commented Jul 29, 2025 •

edited

Loading

Uh oh!

sonic182 commented Jul 29, 2025

Uh oh!

hectorperez left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

sonic182 commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dmoralesl left a comment

Choose a reason for hiding this comment

Uh oh!

sonic182 commented Jul 29, 2025

Uh oh!

dmoralesl commented Jul 29, 2025

Uh oh!

sonic182 commented Jul 29, 2025

Uh oh!

mmacia left a comment

Choose a reason for hiding this comment

Uh oh!

sonic182 commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sonic182 commented Jul 29, 2025

Uh oh!

hectorperez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sonic182 commented Jul 29, 2025 •

edited

Loading

sonic182 commented Jul 29, 2025 •

edited

Loading