> Looks like this landed three days ago: > - https://github.com/ggml-org/llama.cpp/pull/12379 > > So streaming tools might be possible pretty soon! _Originally posted by @simonw in [#3](https://github.com/simonw/llm-llama-server/issues/3#issuecomment-2915201037)_