File tree Expand file tree Collapse file tree 1 file changed +9
-1
lines changed Expand file tree Collapse file tree 1 file changed +9
-1
lines changed Original file line number Diff line number Diff line change @@ -39,7 +39,15 @@ is a convenience copy.
3939
4040To train an RNNoise model, you need both clean speech data, and noise data.
4141Both need to be sampled at 48 kHz, in 16-bit PCM format (machine endian).
42- Clean speech data can be obtained from https://media.xiph.org/rnnoise/data/tts_speech_48k.sw
42+ Clean speech data can be obtained from
43+ https://media.xiph.org/rnnoise/data/tts_speech_48k.sw
44+ For noise data, we suggest concatenating the 48 kHz noise data from DEMAND at
45+ https://zenodo.org/records/1227121
46+ with contrib_noise.sw and synthetic_noise.sw noise files from
47+ https://media.xiph.org/rnnoise/data/
48+ To balance out the data, we recommend using multiple (e.g. 5) copies of the
49+ contrib_noise.sw and synthetic_noise.sw noise files.
50+
4351The first step is to take the speech and noise, and mix them in a variety of ways
4452to simulate real life conditions (including pauses, filtering and more).
4553Assuming the files are called speech.pcm and noise.pcm, start by generating
You can’t perform that action at this time.
0 commit comments