-
Notifications
You must be signed in to change notification settings - Fork 45
Open
Description
everything is working like a charm, great job! Except, when I use xtts_v2 for Chinese voice synthesis, it's slower than with English. I've installed DeepSpeed, but I'm not sure if it's working cause it's still slow. Also, I saw in the docs that "activating DeepSpeed replaces the TTS class with a custom one, FakeTTSWithRawXTTS2. It's like a simple wrapper around the raw XTTS v2.0 with the same API, and it's supposed to enable free voice cloning. ". How do I set that up? Could you maybe write up some instructions for it? By the way, setting streaming_enabled : True didn't seem to help.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels