You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This directory contains an offline demo for running Qwen3 TTS models with vLLM Omni. It builds task-specific inputs and generates WAV files locally.
7
+
8
+
## Model Overview
9
+
10
+
Qwen3 TTS provides multiple task variants for speech generation:
11
+
12
+
-**CustomVoice**: Generate speech with a known speaker identity (speaker ID) and optional instruction.
13
+
-**VoiceDesign**: Generate speech from text plus a descriptive instruction that designs a new voice.
14
+
-**Base**: Voice cloning using a reference audio + reference transcript, with optional mode selection.
15
+
16
+
## Setup
17
+
Please refer to the [stage configuration documentation](https://docs.vllm.ai/projects/vllm-omni/en/latest/configuration/stage_configs/) to configure memory allocation appropriately for your hardware setup.
18
+
19
+
## Quick Start
20
+
21
+
Run a single sample for a task:
22
+
23
+
```
24
+
python end2end.py --query-type CustomVoice
25
+
```
26
+
27
+
Generated audio files are saved to `output_audio/` by default.
This directory contains an offline demo for running Qwen3 TTS models with vLLM Omni. It builds task-specific inputs and generates WAV files locally.
4
+
5
+
## Model Overview
6
+
7
+
Qwen3 TTS provides multiple task variants for speech generation:
8
+
9
+
-**CustomVoice**: Generate speech with a known speaker identity (speaker ID) and optional instruction.
10
+
-**VoiceDesign**: Generate speech from text plus a descriptive instruction that designs a new voice.
11
+
-**Base**: Voice cloning using a reference audio + reference transcript, with optional mode selection.
12
+
13
+
## Setup
14
+
Please refer to the [stage configuration documentation](https://docs.vllm.ai/projects/vllm-omni/en/latest/configuration/stage_configs/) to configure memory allocation appropriately for your hardware setup.
15
+
16
+
## Quick Start
17
+
18
+
Run a single sample for a task:
19
+
20
+
```
21
+
python end2end.py --query-type CustomVoice
22
+
```
23
+
24
+
Generated audio files are saved to `output_audio/` by default.
0 commit comments