Skip to content

Commit 2bfcae4

Browse files
committed
update pre-trained models with audio demos
1 parent 9fa5c4e commit 2bfcae4

8 files changed

+7
-5
lines changed

README.md

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -24,11 +24,11 @@ pip3 install -r requirements.txt
2424

2525
## Inference
2626

27-
You have to download the [pretrained models]() and put them in ``output/ckpt/LJSpeech/``.
27+
You have to download the [pretrained models](https://drive.google.com/drive/folders/1BBuaoSlInwFoUt1PKLxo0Sjl5qWCq945?usp=sharing) and put them in ``output/ckpt/LJSpeech/``.
2828

2929
For English single-speaker TTS, run
3030
```
31-
python3 synthesize.py --text "YOUR_DESIRED_TEXT" --restore_step 900000 --mode single -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml
31+
python3 synthesize.py --text "YOUR_DESIRED_TEXT" --restore_step 160000 --mode single -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml
3232
```
3333
The generated utterances will be put in ``output/result/``.
3434

@@ -37,7 +37,7 @@ The generated utterances will be put in ``output/result/``.
3737
Batch inference is also supported, try
3838

3939
```
40-
python3 synthesize.py --source preprocessed_data/LJSpeech/val.txt --restore_step 900000 --mode batch -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml
40+
python3 synthesize.py --source preprocessed_data/LJSpeech/val.txt --restore_step 160000 --mode batch -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml
4141
```
4242
to synthesize all utterances in ``preprocessed_data/LJSpeech/val.txt``
4343

@@ -46,7 +46,7 @@ The pitch/volume/speaking rate of the synthesized utterances can be controlled b
4646
For example, one can increase the speaking rate by 20 % and decrease the volume by 20 % by
4747

4848
```
49-
python3 synthesize.py --text "YOUR_DESIRED_TEXT" --restore_step 900000 --mode single -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml --duration_control 0.8 --energy_control 0.8
49+
python3 synthesize.py --text "YOUR_DESIRED_TEXT" --restore_step 160000 --mode single -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml --duration_control 0.8 --energy_control 0.8
5050
```
5151

5252
# Training
@@ -107,7 +107,9 @@ tensorboard --logdir output/log/LJSpeech
107107
to serve TensorBoard on your localhost.
108108
The loss curves, synthesized mel-spectrograms, and audios are shown.
109109

110-
110+
![](./img/tensorboard_loss.png)
111+
![](./img/tensorboard_spec.png)
112+
![](./img/tensorboard_audio.png)
111113

112114
# Implementation Issues
113115

169 KB
Loading
Loading

img/tensorboard_audio.png

385 KB
Loading

img/tensorboard_loss.png

458 KB
Loading

img/tensorboard_spec.png

1.85 MB
Loading

0 commit comments

Comments
 (0)