fix: send remaining sentences immediately when end_input() is called #4440

martin-purplefish · 2026-01-04T13:49:05Z

Summary

Fixes a bug in StreamPacerWrapper where calling end_input() did not immediately send remaining buffered sentences to TTS, causing multi-second delays in agent responses.

The Bug

When end_input() is called (indicating the user has finished speaking), the pacer continued to wait based on the remaining_audio timer calculation instead of immediately sending all remaining text:

end_input() only woke the send task conditionally - it only called _wakeup_event.set() when the audio emitter's destination channel was closed, not when it was still open
Send condition didn't account for input ending - the send loop only sent text when it was the first sentence or when generation stopped and remaining audio was low

Example of the Problem

With min_remaining_audio = 5.0s:

t=0.0s: First sentence sent; TTS produces 10s audio
t=0.5s: Two more sentences queued
t=0.6s: end_input() called while audio emitter is still open
- _input_ended = True, but no wakeup occurs
- Send task sleeps on timer: remaining_audio - min_remaining_audio = 10 - 5 = 5s
t=5.5s: Next send finally happens

Result: ~5 second delay after user finishes speaking before remaining sentences are synthesized.

Changes

Always wake the send task on end_input() - moved _wakeup_event.set() outside the conditional
Added send condition for ended input - (self._input_ended and self._sentences) triggers immediate sending

Why this is correct

The purpose of pacing is to:

Reduce waste from interruptions - not relevant once input ends; we're committed to this response
Send larger chunks for better speech quality - still respected via max_text_length batching

Once input has ended, we know exactly what text needs to be synthesized and there's no benefit to delaying. The max_text_length batching is still respected, so we're not bypassing quality optimizations - just the waiting.

Test plan

Verify that when end_input() is called with pending sentences, they are sent immediately (within ~1 event loop iteration)
Verify that max_text_length batching is still respected when input ends
Verify normal pacing behavior is unchanged when input has not ended

🤖 Generated with Claude Code

🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <[email protected]>

martin-purplefish · 2026-01-04T14:00:15Z

Hunting down random delays in TTS - this seems promising. It feels like an obvious bug - though I'm not convinced about sending/flushing all of the sentences versus dropping them. Tested this out locally.

longcw · 2026-01-04T14:21:05Z

Hunting down random delays in TTS

do you have text_pacing enabled? it's disabled by default and it's used to slow down the TTS generation after the first sentence to save TTS usage in case the speech is interrupted.

When end_input() is called (indicating the user has finished speaking), the pacer continued to wait based on the remaining_audio timer calculation instead of immediately sending all remaining text

this is the intended behavior. end_input is called after LLM generation, which is usually faster than audio playout, so we don't want to start the rest TTS generation immediately, it should wait for the audio playout as usual even end_input is called.

With min_remaining_audio = 5.0s:

t=0.0s: First sentence sent; TTS produces 10s audio
t=0.5s: Two more sentences queued
t=0.6s: end_input() called while audio emitter is still open
_input_ended = True, but no wakeup occurs
Send task sleeps on timer: remaining_audio - min_remaining_audio = 10 - 5 = 5s
t=5.5s: Next send finally happens
Result: ~5 second delay after user finishes speaking before remaining sentences are synthesized.

this is the expected result with tts text pacing enabled, I guess the AI tool raised this as a bug because it didn't know we are playing the audio in real-time instead of trying to generate audio as fast as possible.

longcw · 2026-01-04T14:27:15Z

here is an example of using tts text pacing https://github.com/livekit/agents/blob/[email protected]/examples/voice_agents/tts_text_pacing.py, you can see the logs like the following to indicate the sentences were sent to the TTS while the remaining audio was about to be all played

22:23:28.321 DEBUG livekit.agents sent text to tts
{"text": " She was known for ...", "remaining_audio": 4.859836101531982, "pid": 2875548, "job_id": "AJ_Btwoh8EoeLjY", "room_id": "RM_R6ZuwrSQUVY5"}

martin-purplefish · 2026-01-04T14:29:31Z

Oh got it, so this is expected then. No, we don't. wouldn't the behavior still happen though? Will close!

longcw · 2026-01-04T14:31:34Z

it shouldn't happen if it's not enabled explicitly as in the example.

martin-purplefish and others added 2 commits January 4, 2026 08:47

fix: send remaining sentences immediately when end_input() is called

122773a

🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <[email protected]>

format

7fc73bb

martin-purplefish closed this Jan 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: send remaining sentences immediately when end_input() is called #4440

fix: send remaining sentences immediately when end_input() is called #4440

Uh oh!

martin-purplefish commented Jan 4, 2026 •

edited

Loading

Uh oh!

martin-purplefish commented Jan 4, 2026 •

edited

Loading

Uh oh!

longcw commented Jan 4, 2026 •

edited

Loading

Uh oh!

longcw commented Jan 4, 2026

Uh oh!

martin-purplefish commented Jan 4, 2026

Uh oh!

longcw commented Jan 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: send remaining sentences immediately when end_input() is called #4440

fix: send remaining sentences immediately when end_input() is called #4440

Uh oh!

Conversation

martin-purplefish commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

The Bug

Example of the Problem

Changes

Why this is correct

Test plan

Uh oh!

martin-purplefish commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

longcw commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

longcw commented Jan 4, 2026

Uh oh!

martin-purplefish commented Jan 4, 2026

Uh oh!

longcw commented Jan 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

martin-purplefish commented Jan 4, 2026 •

edited

Loading

martin-purplefish commented Jan 4, 2026 •

edited

Loading

longcw commented Jan 4, 2026 •

edited

Loading