Skip to content

Commit 43d5667

Browse files
committed
update readme
1 parent 0664ec0 commit 43d5667

File tree

1 file changed

+23
-0
lines changed

1 file changed

+23
-0
lines changed

README.md

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -184,6 +184,29 @@ After preprocessing, the structure of the project folder should be:
184184

185185
We use suffix (`ids`, `raw-feats`, `raw-energy`, `raw-f0`, `norm-feats` and `wave`) for each type of input.
186186

187+
### Preprocessing Chinese Dataset
188+
please download the open dataset from [Data-Baker](https://weixinxcxdb.oss-cn-beijing.aliyuncs.com/gwYinPinKu/BZNSYP.rar), and extract data like this:
189+
```
190+
.
191+
├── PhoneLabeling
192+
│ ├── 000001.interval
193+
│ ├── ...
194+
│ └── 010000.interval
195+
├── ProsodyLabeling
196+
│ └── 000001-010000.txt
197+
└── Wave
198+
├── 000001.wav
199+
├── ...
200+
└── 010000.wav
201+
```
202+
203+
after install tensorflowtts, you can process data like this:
204+
```shell
205+
tensorflow-tts-preprocess --dataset baker --rootdir ./baker --outdir ./dump --config ./preprocess/baker_preprocess.yaml
206+
tensorflow-tts-normalize --rootdir ./dump --outdir ./dump --config ./preprocess/baker_preprocess.yaml --dataset baker
207+
```
208+
209+
187210
**IMPORTANT NOTES**:
188211
- This preprocessing step is based on [ESPnet](https://github.com/espnet/espnet) so you can combine all models here with other models from ESPnet repository.
189212

0 commit comments

Comments
 (0)