ci: Add vLLM support to integration testing infrastructure (with qwen) #3545

derekhiggins · 2025-09-24T21:53:14Z

o Introduces vLLM provider support to the record/replay testing framework
o Enabling both recording and replay of vLLM API interactions alongside existing Ollama support.

The changes enable testing of vLLM functionality. vLLM tests focus on
inference capabilities, while Ollama continues to exercise the full API surface
including vision features.

--
This is an alternative to #3128 , using qwen3 instead of llama 3.2 1B appears to be more capable at structure output and tool calls.

tests/integration/suites.py

mattf · 2025-10-09T16:42:57Z

tests/integration/suites.py

@@ -168,6 +168,11 @@ class Setup(BaseModel):
        roots=base_roots,
        default_setup="ollama",
    ),
+    "base-vllm-subset": Suite(


is this needed anymore?

My intent here was to add this job with only the tests in "tests/integration/inference" and then once we're happy we haven't cause any major disruption we could expand to the entire suit.

@derekhiggins you feel like this is ready?

Yes, I believe so,
Although the ollama record job is broken, I've had to rebase with this commit #3898 in order to get new ollama recordings

Signed-off-by: Derek Higgins <[email protected]>

It preforms better in tool calling and structured tests Signed-off-by: Derek Higgins <[email protected]>

Add vLLM provider support to integration test CI workflows alongside existing Ollama support. Configure provider-specific test execution where vLLM runs only inference specific tests (excluding vision tests) while Ollama continues to run the full test suite. This enables comprehensive CI testing of both inference providers but keeps the vLLM footprint small, this can be expanded later if it proves to not be too disruptive. Also updated test skips that were marked with "inline::vllm", this should be "remote::vllm". This causes some failing log probs tests to be skipped and should be revisted. Signed-off-by: Derek Higgins <[email protected]>

Signed-off-by: Derek Higgins <[email protected]>

The vector_provider_wrapper was only limiting providers to faiss/sqlite-vec for replay mode, but CI tests also run in record mode with the same limited set of providers. This caused test failures when trying to test against milvus, chromadb, pgvector, weaviate, and qdrant which aren't configured in the record job.

derekhiggins requested review from ashwinb, bbrowning, ehhuang, hardikjshah, leseb, mattf, raghotham, reluctantfuturist, slekkala1, terrytangyuan and yanxi0830 as code owners September 24, 2025 21:53

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 24, 2025

derekhiggins mentioned this pull request Sep 24, 2025

ci: Add vLLM support to integration testing infrastructure #3128

Closed

derekhiggins force-pushed the vllm-ci-qwen branch from e07b4bb to 746e9c9 Compare September 30, 2025 08:39

slekkala1 reviewed Oct 8, 2025

View reviewed changes

tests/integration/suites.py Show resolved Hide resolved

derekhiggins force-pushed the vllm-ci-qwen branch from 746e9c9 to 5468bab Compare October 9, 2025 11:13

derekhiggins requested a review from franciscojavierarceo as a code owner October 9, 2025 11:13

derekhiggins force-pushed the vllm-ci-qwen branch 2 times, most recently from 88e722c to 1e995fe Compare October 9, 2025 15:38

mattf approved these changes Oct 9, 2025

View reviewed changes

derekhiggins force-pushed the vllm-ci-qwen branch 8 times, most recently from 1c3ea50 to 0f0d986 Compare October 15, 2025 09:58

derekhiggins added 2 commits October 23, 2025 20:29

ci: test adjustments for Qwen3-0.6B

3cd1ef4

Signed-off-by: Derek Higgins <[email protected]>

ci: Switch vllm config to qwen3

ee53538

It preforms better in tool calling and structured tests Signed-off-by: Derek Higgins <[email protected]>

derekhiggins added 2 commits October 23, 2025 20:33

chore: normalize all recordings

6e65e72

Signed-off-by: Derek Higgins <[email protected]>

derekhiggins force-pushed the vllm-ci-qwen branch from 0f0d986 to 6e65e72 Compare October 23, 2025 19:34

derekhiggins and others added 2 commits October 24, 2025 00:31

Recordings update from CI (setup: vllm, suite: base-vllm-subset)

b4e928c

derekhiggins force-pushed the vllm-ci-qwen branch from 83a73b1 to b4e928c Compare October 23, 2025 23:33

Recordings update from CI (setup: ollama, suite: base)

b1d1af9

nathan-weinberg approved these changes Oct 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ci: Add vLLM support to integration testing infrastructure (with qwen) #3545

ci: Add vLLM support to integration testing infrastructure (with qwen) #3545

Uh oh!

derekhiggins commented Sep 24, 2025

Uh oh!

Uh oh!

mattf Oct 9, 2025

Uh oh!

derekhiggins Oct 10, 2025

Uh oh!

ashwinb Oct 21, 2025

Uh oh!

derekhiggins Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

ci: Add vLLM support to integration testing infrastructure (with qwen) #3545

Are you sure you want to change the base?

ci: Add vLLM support to integration testing infrastructure (with qwen) #3545

Uh oh!

Conversation

derekhiggins commented Sep 24, 2025

Uh oh!

Uh oh!

mattf Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

derekhiggins Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

ashwinb Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

derekhiggins Oct 24, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants