[Test] Add e2e tests for Qwen3-TTS speech endpoint#1206
[Test] Add e2e tests for Qwen3-TTS speech endpoint#1206Gaohan123 merged 20 commits intovllm-project:mainfrom
Conversation
Signed-off-by: linyueqian <linyueqian@outlook.com>
|
could you please also add the api server endpoint in doc? |
There was a problem hiding this comment.
Pull request overview
This PR adds comprehensive e2e tests for the Qwen3-TTS model's /v1/audio/speech endpoint. The tests were created in response to bug #1159, where unit tests with mocks didn't catch real behavior issues. The new tests verify actual model inference without mocks.
Changes:
- Adds e2e tests for Qwen3-TTS CustomVoice and VoiceDesign models
- Tests
/v1/audio/speech,/v1/audio/voices, and/v1/modelsendpoints - Includes regression test for multimodal_output bug that caused "TTS model did not produce audio output" error
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Signed-off-by: linyueqian <linyueqian@outlook.com>
a3d7d9b to
79677e2
Compare
OK. I have added that doc. |
Signed-off-by: linyueqian <linyueqian@outlook.com>
Signed-off-by: linyueqian <linyueqian@outlook.com>
Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com>
Signed-off-by: linyueqian <linyueqian@outlook.com>
Signed-off-by: linyueqian <linyueqian@outlook.com>
Thank you! I have added them. |
|
fix ci please |
Need to merge #1203 first. The test catches that bug so it's expected to fail without it. |
merged, fix ci please |
Signed-off-by: linyueqian <linyueqian@outlook.com>
Signed-off-by: linyueqian <linyueqian@outlook.com>
|
@hsliuustc0106 Should be fixed now. |
|
We plan to set the running time (timeout_in_minutes) of daily CI to 10 minutes, Could we preserve only the core code changes in this PR to reduce CI runtime, and move the remaining changes to the nightly CI (.buildkite/test-nightly.yaml)? |
Signed-off-by: linyueqian <linyueqian@outlook.com>
The full test suite only takes ~6 min so it should fit within the 10 min limit. I've reduced the timeout accordingly. Happy to split if you still think it's needed though. |
|
Hi, I noticed that you removed the markers to pass the CI test. The error already fixed after #577 is merged. You can add those markers back and try again : ) |
Signed-off-by: linyueqian <linyueqian@outlook.com>
Signed-off-by: linyueqian <linyueqian@outlook.com>
Signed-off-by: linyueqian <linyueqian@outlook.com>
Signed-off-by: linyueqian <linyueqian@outlook.com>
Signed-off-by: linyueqian <linyueqian@outlook.com>
|
@congw729 Added back @hsliuustc0106 CI is passing now. One thing I ran into: |
|
Also cc #1289 which fixes the same metrics bug. Would be good to merge this test PR so future updates like this get caught in CI. |
Signed-off-by: linyueqian <linyueqian@outlook.com> Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com>
PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.
Purpose
Add e2e tests for Qwen3-TTS
/v1/audio/speechendpoint. The existing unit tests used mocks that didn't match real behavior, allowing bugs like #1159 to slip through undetected.Also add doc for tts.
Test Plan
Test Result
Verified the test correctly catches the multimodal_output bug:
Without fix (main branch):
FAILED tests/e2e/online_serving/test_qwen3_tts.py::TestQwen3TTSCustomVoice::test_speech_english_basic
Server returns {"error":{"message":"TTS model did not produce audio output."}} instead of WAV audio.
With fix:
PASSED - Valid WAV audio returned (87KB, 24000Hz mono)
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)