perf: use deque for async response chunk iteration by giulio-leone · Pull Request #1362 · simonw/llm

giulio-leone · 2026-02-28T05:04:04Z

Problem

AsyncResponse.__anext__() replays cached chunks when the response is already done via list.pop(0), which is O(n) per removal. For long responses with many chunks, replaying becomes O(n²).

Solution

Switch _iter_chunks from list to collections.deque, replacing .pop(0) with .popleft() for O(1) front removal.

Changes

llm/models.py:
- Import deque from collections
- _iter_chunks = deque(self._chunks) instead of list(...)
- .pop(0) → .popleft()

Testing

Syntax verified via ast.parse()

AsyncResponse.__anext__() replays cached chunks via list.pop(0) when the response is already done, which is O(n) per removal. Switch to collections.deque with popleft() for O(1).

giulio-leone · 2026-02-28T21:33:27Z

Friendly ping — CI is green and this is ready for review. Happy to address any feedback. Thanks!

perf: use deque for async response chunk iteration

3f80a2a

AsyncResponse.__anext__() replays cached chunks via list.pop(0) when the response is already done, which is O(n) per removal. Switch to collections.deque with popleft() for O(1).

giulio-leone force-pushed the fix/async-iter-deque-performance branch from 553ccca to 3f80a2a Compare February 28, 2026 14:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: use deque for async response chunk iteration#1362

perf: use deque for async response chunk iteration#1362
giulio-leone wants to merge 1 commit intosimonw:mainfrom
giulio-leone:fix/async-iter-deque-performance

giulio-leone commented Feb 28, 2026

Uh oh!

giulio-leone commented Feb 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

giulio-leone commented Feb 28, 2026

Problem

Solution

Changes

Testing

Uh oh!

giulio-leone commented Feb 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant