Skip to content

Commit 0202b8c

Browse files
authored
Merge pull request #1452 from yt605155624/add_vc0_readme
[doc]update readme for aishell3_vc0
2 parents 2bf3b3a + b5a7c2d commit 0202b8c

File tree

3 files changed

+24
-2
lines changed

3 files changed

+24
-2
lines changed

docs/source/released_model.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -61,7 +61,7 @@ WaveRNN | CSMSC |[WaveRNN-csmsc](https://github.com/PaddlePaddle/PaddleSpeech/tr
6161
Model Type | Dataset| Example Link | Pretrained Models
6262
:-------------:| :------------:| :-----: | :-----:
6363
GE2E| AISHELL-3, etc. |[ge2e](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/other/ge2e)|[ge2e_ckpt_0.3.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/ge2e/ge2e_ckpt_0.3.zip)
64-
GE2E + Tactron2| AISHELL-3 |[ge2e-tactron2-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/vc0)|[tacotron2_aishell3_ckpt_0.3.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/tacotron2/tacotron2_aishell3_ckpt_0.3.zip)
64+
GE2E + Tactron2| AISHELL-3 |[ge2e-tactron2-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/vc0)|[tacotron2_aishell3_ckpt_vc0_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/tacotron2/tacotron2_aishell3_ckpt_vc0_0.2.0.zip)
6565
GE2E + FastSpeech2 | AISHELL-3 |[ge2e-fastspeech2-aishell3](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/vc1)|[fastspeech2_nosil_aishell3_vc1_ckpt_0.5.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/fastspeech2/fastspeech2_nosil_aishell3_vc1_ckpt_0.5.zip)
6666

6767

examples/aishell3/vc0/README.md

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -116,3 +116,25 @@ ref_audio
116116
```bash
117117
CUDA_VISIBLE_DEVICES=${gpus} ./local/voice_cloning.sh ${conf_path} ${train_output_path} ${ckpt_name} ${ge2e_params_path} ${ref_audio_dir}
118118
```
119+
120+
## Pretrained Model
121+
[tacotron2_aishell3_ckpt_vc0_0.2.0.zip](https://paddlespeech.bj.bcebos.com/Parakeet/released_models/tacotron2/tacotron2_aishell3_ckpt_vc0_0.2.0.zip)
122+
123+
124+
Model | Step | eval/loss | eval/l1_loss | eval/mse_loss | eval/bce_loss| eval/attn_loss
125+
:-------------:| :------------:| :-----: | :-----: | :--------: |:--------:|:---------:
126+
default| 2(gpu) x 37596|0.58704|0.39623|0.15073|0.039|1.9981e-04|
127+
128+
Tacotron2 checkpoint contains files listed below.
129+
(There is no need for `speaker_id_map.txt` here )
130+
131+
```text
132+
tacotron2_aishell3_ckpt_vc0_0.2.0
133+
├── default.yaml # default config used to train tacotron2
134+
├── phone_id_map.txt # phone vocabulary file when training tacotron2
135+
├── snapshot_iter_37596.pdz # model parameters and optimizer states
136+
└── speech_stats.npy # statistics used to normalize spectrogram when training tacotron2
137+
```
138+
139+
## More
140+
We strongly recommend that you use [FastSpeech2 + AISHELL-3 Voice Cloning](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/aishell3/vc1) which works better.

examples/aishell3/vc0/conf/default.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,7 +77,7 @@ optimizer:
7777
###########################################################
7878
# TRAINING SETTING #
7979
###########################################################
80-
max_epoch: 200
80+
max_epoch: 100
8181
num_snapshots: 5
8282

8383
###########################################################

0 commit comments

Comments
 (0)