feat: add async and streaming #137

jakelorocco · 2025-09-11T11:26:50Z

Discussion: #103

Currently, the draft PR only adds the capability to ollama to highlight what changes were necessary.

Changes

m.act and m.validate had to be changed to support async calls
generate_from_context now returns a model output thunk that is ready for generation
- model output thunk gets functions for generating and processing the output
- generate returns a model output thunk with awaitable values but is not an async function
- this also sets up support for lazy computation
by default, validation will run asynchronously
sampling strategies must be changed to support async generation

Note: I'm not happy with where/how some functions are defined (mostly the processing functions). I am planning on moving those. These changes also set us up for simplifying backends. A generic backend could define this control flow of generation and prepping a model output thunk while calling the processing methods implemented by a specific backend.

mergify · 2025-09-11T11:27:24Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?:

nrfulton

The overall approach looks good to me. I do wonder about everything becoming async but I think it's okay.

mellea/stdlib/base.py

jakelorocco · 2025-09-11T16:18:41Z

The overall approach looks good to me. I do wonder about everything becoming async but I think it's okay.

@nrfulton, if we want to keep synchronous versions of functions around we definitely can. generate_from_context would just have to have a parameter that sets the correct generate function depending on what's desired. It just makes higher level abstractions like sampling strategies and result validation a bit more complicated with juggling the async vs sync versions.

HendrikStrobelt

I think, I understand the logic... but currently from the outside, .chat and .instruct act the same for streaming and non-streaming, or ?

jakelorocco · 2025-09-12T19:35:56Z

I think, I understand the logic... but currently from the outside, .chat and .instruct act the same for streaming and non-streaming, or ?

Yes; once we introduce partial requirement checking / logging, you will be able to notice differences with streaming, but no real functional differences between the two.

jakelorocco · 2025-09-17T01:18:34Z

I still need to make a few tweaks and push my changes to fix aloras / logging.

jakelorocco · 2025-09-22T20:12:23Z

@nrfulton @HendrikStrobelt
I was unable to test vllm/openai aloras because there is some issue with our vllm script (will open an issue for that as well). All the other tests passed locally for me (but we'll see what the github actions say).

Still need to finish up and push the documentation, but the code is ready for review. I'll monitor for test failures, I believe I fixed all of the things failing due to merging main back into my branch.

jakelorocco · 2025-09-23T13:19:16Z

I am trying to debug why the tests are failing; it's some out of space issue so I'm not sure if I just happened to be the unlucky one that pushed us over the edge or if some change I made is actually causing the issue.

jakelorocco · 2025-09-23T20:15:40Z

Tested that new code works in collab by installing my specific branch of mellea. Didn't see any issues in the notebook.

init changes for ollama and common functions

68d859f

jakelorocco requested review from HendrikStrobelt and nrfulton September 11, 2025 11:26

nrfulton reviewed Sep 11, 2025

View reviewed changes

mellea/stdlib/base.py Show resolved Hide resolved

HendrikStrobelt reviewed Sep 12, 2025

View reviewed changes

feat: add async support to all backends; add some helpers

ec68047

jakelorocco changed the title ~~feat: add async and streaming (wip; only for ollama)~~ feat: add async and streaming Sep 17, 2025

yelkurdi mentioned this pull request Sep 18, 2025

feat: majority voting sampling strategy #142

Merged

5 tasks

jakelorocco and others added 8 commits September 18, 2025 16:04

feat: add support for generatelogs to async; fix a few bugs

39c6cd8

fix: don't copy objects in a context

396705e

feat: move processing funcs to partial bind, cleanup genlogs

e5bf43e

feat: add async tests and minor tweaks

0932fed

fix: watsonx tests

2f27026

feat: add support for aloras; could not test vllm

e26fd75

Merge branch 'main' into jal/async-streaming

4731fde

fix: fix validation sig errors caused by rebasing to main

f046062

jakelorocco marked this pull request as ready for review September 22, 2025 20:22

jakelorocco added 2 commits September 22, 2025 17:12

fix: cleanup

eb28adb

fix: cleanup code

1200c71

avinash2692 and others added 5 commits September 23, 2025 11:39

test: fixing testing for openai tests with ollama

8154d21

fix: some tests and add some comments

40596a2

fix: apply proper pytest marks to new tests

8ce02cd

ci: adding action to free disk space

f5045f2

fix: resolve addtl references to backend.generate_from_context

6fc6f68

jakelorocco mentioned this pull request Sep 23, 2025

feat: Adds a vllm backend #122

Merged

jakelorocco merged commit 4ee56a9 into main Sep 23, 2025
4 checks passed

jakelorocco deleted the jal/async-streaming branch September 23, 2025 20:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add async and streaming #137

feat: add async and streaming #137

Uh oh!

jakelorocco commented Sep 11, 2025

Uh oh!

mergify bot commented Sep 11, 2025

Uh oh!

nrfulton left a comment

Uh oh!

Uh oh!

jakelorocco commented Sep 11, 2025

Uh oh!

HendrikStrobelt left a comment

Uh oh!

jakelorocco commented Sep 12, 2025

Uh oh!

jakelorocco commented Sep 17, 2025

Uh oh!

jakelorocco commented Sep 22, 2025

Uh oh!

jakelorocco commented Sep 23, 2025

Uh oh!

jakelorocco commented Sep 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

feat: add async and streaming #137

feat: add async and streaming #137

Uh oh!

Conversation

jakelorocco commented Sep 11, 2025

Uh oh!

mergify bot commented Sep 11, 2025

Merge Protections

🟢 Enforce conventional commit

Uh oh!

nrfulton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jakelorocco commented Sep 11, 2025

Uh oh!

HendrikStrobelt left a comment

Choose a reason for hiding this comment

Uh oh!

jakelorocco commented Sep 12, 2025

Uh oh!

jakelorocco commented Sep 17, 2025

Uh oh!

jakelorocco commented Sep 22, 2025

Uh oh!

jakelorocco commented Sep 23, 2025

Uh oh!

jakelorocco commented Sep 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants