Skip to content

Commit c1e5841

Browse files
committed
support for voice cloning is done (+2 squashed commit)
Squashed commit: [e730162] support for voice cloning is done [1653c57] wip adding voice cloning
1 parent a66d0f7 commit c1e5841

27 files changed

+7031
-8
lines changed

examples/outetts/readme.md

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,23 @@
1+
## KoboldCpp now supports OuteTTS Voice Cloning
2+
3+
However, it can be slightly challenging to set it up.
4+
- The easiest way to get started is to use already pre-cloned voices, which you can find within the [speakers directory](speakers).
5+
- Simply download the .json speaker file, open KoboldCpp with an OuteTTS model and WavTokenizer loaded.
6+
- Paste the voice JSON into Kobold Lite UI's Settings > Media > TTS > voiceclone option.
7+
8+
![Preview](vc1.png)
9+
![Preview](vc2.png)
10+
11+
- If you don't know how to get the OuteTTS and WavTokenizer models, [please read the wiki](https://github.com/LostRuins/koboldcpp/wiki#getting-an-ai-model-file).
12+
- Once configured, you are good to go. Narrations will be done with that selected cloned speaker.
13+
14+
### Cloning your own speaker
15+
You can also create your own cloned speaker voice.
16+
To do that, you will need Python 3.10+ with the `outetts` package installed. You can install it with this commands:
17+
```
18+
pip install outetts --no-deps
19+
pip install scipy einops pyyaml huggingface-hub encodec matplotlib transformers pytorch-lightning tensorboardX soundfile numpy jsonargparse torchcrepe librosa inflect loguru polars natsort tqdm sounddevice mecab-python3 unidic-lite uroman openai-whisper pygame
20+
```
21+
Then after that, obtain a wav file containing a sample of the voice you want to clone.
22+
Modify the [voice_cloning.py](voice_cloning.py) script with your input audio file, and let it generate the JSON.
23+
Finally, use the JSON file in the same way as the above with the precloned voices.

0 commit comments

Comments
 (0)