[Test] Add e2e tests for Qwen3-TTS speech endpoint by linyueqian · Pull Request #1206 · vllm-project/vllm-omni

linyueqian · 2026-02-04T22:06:50Z

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.

Purpose

Add e2e tests for Qwen3-TTS /v1/audio/speech endpoint. The existing unit tests used mocks that didn't match real behavior, allowing bugs like #1159 to slip through undetected.

Also add doc for tts.

Test Plan

pytest tests/e2e/online_serving/test_qwen3_tts.py -v -s

Test Result

Verified the test correctly catches the multimodal_output bug:

Without fix (main branch):
FAILED tests/e2e/online_serving/test_qwen3_tts.py::TestQwen3TTSCustomVoice::test_speech_english_basic
Server returns {"error":{"message":"TTS model did not produce audio output."}} instead of WAV audio.
With fix:
PASSED - Valid WAV audio returned (87KB, 24000Hz mono)

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Signed-off-by: linyueqian <linyueqian@outlook.com>

hsliuustc0106 · 2026-02-05T02:06:22Z

could you please also add the api server endpoint in doc?

Copilot

Pull request overview

This PR adds comprehensive e2e tests for the Qwen3-TTS model's /v1/audio/speech endpoint. The tests were created in response to bug #1159, where unit tests with mocks didn't catch real behavior issues. The new tests verify actual model inference without mocks.

Changes:

Adds e2e tests for Qwen3-TTS CustomVoice and VoiceDesign models
Tests /v1/audio/speech, /v1/audio/voices, and /v1/models endpoints
Includes regression test for multimodal_output bug that caused "TTS model did not produce audio output" error

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/e2e/online_serving/test_qwen3_tts.py

Signed-off-by: linyueqian <linyueqian@outlook.com>

linyueqian · 2026-02-05T02:26:20Z

could you please also add the api server endpoint in doc?

OK. I have added that doc.

docs/serving/speech_api.md

tests/e2e/online_serving/test_qwen3_tts.py

Signed-off-by: linyueqian <linyueqian@outlook.com>

tests/e2e/online_serving/test_qwen3_tts.py

Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com>

Signed-off-by: linyueqian <linyueqian@outlook.com>

tests/e2e/online_serving/test_qwen3_tts.py

congw729

Please add markers as suggested; more usage could be checked at #577

Signed-off-by: linyueqian <linyueqian@outlook.com>

linyueqian · 2026-02-06T20:44:27Z

Please add markers as suggested; more usage could be checked at #577

Thank you! I have added them.

hsliuustc0106 · 2026-02-06T23:30:23Z

fix ci please

linyueqian · 2026-02-07T01:03:01Z

fix ci please

Need to merge #1203 first. The test catches that bug so it's expected to fail without it.

hsliuustc0106 · 2026-02-08T06:04:45Z

fix ci please

Need to merge #1203 first. The test catches that bug so it's expected to fail without it.

merged, fix ci please

Signed-off-by: linyueqian <linyueqian@outlook.com>

linyueqian · 2026-02-08T23:55:32Z

@hsliuustc0106 Should be fixed now.

zhumingjue138 · 2026-02-09T01:38:47Z

We plan to set the running time (timeout_in_minutes) of daily CI to 10 minutes, Could we preserve only the core code changes in this PR to reduce CI runtime, and move the remaining changes to the nightly CI (.buildkite/test-nightly.yaml)?

Signed-off-by: linyueqian <linyueqian@outlook.com>

linyueqian · 2026-02-09T04:14:47Z

We plan to set the running time (timeout_in_minutes) of daily CI to 10 minutes, Could we preserve only the core code changes in this PR to reduce CI runtime, and move the remaining changes to the nightly CI (.buildkite/test-nightly.yaml)?

The full test suite only takes ~6 min so it should fit within the 10 min limit. I've reduced the timeout accordingly. Happy to split if you still think it's needed though.

congw729 · 2026-02-09T09:43:42Z

Hi, I noticed that you removed the markers to pass the CI test. The error already fixed after #577 is merged. You can add those markers back and try again : )

Signed-off-by: linyueqian <linyueqian@outlook.com>

linyueqian · 2026-02-10T00:12:18Z

@congw729 Added back @hardware_test, @pytest.mark.core_model and @pytest.mark.omni on all tests.

@hsliuustc0106 CI is passing now. One thing I ran into: record_audio_generated_frames added in the recently merged #891 crashes when the audio output tensor is 0-dim (scalar), since shape[0] raises IndexError. I wrapped it in try/except so metrics errors don't break speech generation.

linyueqian · 2026-02-10T00:16:29Z

Also cc #1289 which fixes the same metrics bug. Would be good to merge this test PR so future updates like this get caught in CI.

Gaohan123

LGTM. Thanks!

Signed-off-by: linyueqian <linyueqian@outlook.com> Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com>

Add e2e tests for Qwen3-TTS to prevent regression in audio output

27db78d

Signed-off-by: linyueqian <linyueqian@outlook.com>

hsliuustc0106 requested a review from Copilot February 5, 2026 02:06

Copilot started reviewing on behalf of hsliuustc0106 February 5, 2026 02:06 View session

Copilot AI reviewed Feb 5, 2026

View reviewed changes

tests/e2e/online_serving/test_qwen3_tts.py Outdated Show resolved Hide resolved

tests/e2e/online_serving/test_qwen3_tts.py Outdated Show resolved Hide resolved

linyueqian requested a review from hsliuustc0106 as a code owner February 5, 2026 02:24

Add Speech API documentation and fix e2e test fixtures

79677e2

Signed-off-by: linyueqian <linyueqian@outlook.com>

linyueqian force-pushed the test/qwen3-tts-e2e branch from a3d7d9b to 79677e2 Compare February 5, 2026 02:25

linyueqian mentioned this pull request Feb 5, 2026

feat(tts): add voice upload API for Qwen3-TTS #1201

Open

hsliuustc0106 reviewed Feb 5, 2026

View reviewed changes

docs/serving/speech_api.md Show resolved Hide resolved

hsliuustc0106 reviewed Feb 5, 2026

View reviewed changes

docs/serving/speech_api.md Outdated Show resolved Hide resolved

tests/e2e/online_serving/test_qwen3_tts.py Show resolved Hide resolved

linyueqian and others added 3 commits February 5, 2026 08:14

Merge branch 'main' into test/qwen3-tts-e2e

e31a8e8

Update port from 8000 to 8091 in Speech API documentation

7207e5b

Signed-off-by: linyueqian <linyueqian@outlook.com>

Add Qwen3-TTS e2e test to Buildkite pipeline

555342f

Signed-off-by: linyueqian <linyueqian@outlook.com>

This was referenced Feb 5, 2026

[RFC]: Qwen3-TTS Production Ready - February Milestone #938

Open

[Doc] Update Qwen3-TTS docs for consistency with Omni examples #1226

Merged

hsliuustc0106 approved these changes Feb 5, 2026

View reviewed changes

tests/e2e/online_serving/test_qwen3_tts.py Show resolved Hide resolved

Merge branch 'main' into test/qwen3-tts-e2e

a92c32b

Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com>

hsliuustc0106 added the ready label to trigger buildkite CI label Feb 5, 2026

Format server commands with one argument per line

72a9c36

Signed-off-by: linyueqian <linyueqian@outlook.com>