-
Notifications
You must be signed in to change notification settings - Fork 471
Description
Motivation.
At present, vllm-omni has not yet integrated with the index-tts2 model. Community users have clearly expressed their expectations for this integration, so this RFC is hereby submitted to clarify the timeline for the integration.
Proposed Change.
Integrate the index-tts2 model into the vllm-omni framework, enabling users to call text-to-speech (TTS) capabilities natively within vllm-omni, and achieve seamless collaboration with existing large language model (LLM) inference workflows.
Expected Feature Scope
Support batch text input for parallel TTS inference.
Provide configurable parameters (e.g., speech speed, pitch, speaker voice type) to meet customized needs.
Maintain consistent inference performance with the standalone index-tts2 model (low latency, high throughput).
Add corresponding usage examples and API documentation in the vllm-omni official documentation.
Feedback Period.
No response
CC List.
No response
Any Other Things.
No response
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.