Skip to content

Commit 37cf35f

Browse files
committed
Noise data
1 parent 0ef02af commit 37cf35f

File tree

1 file changed

+9
-1
lines changed

1 file changed

+9
-1
lines changed

README

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,15 @@ is a convenience copy.
3939

4040
To train an RNNoise model, you need both clean speech data, and noise data.
4141
Both need to be sampled at 48 kHz, in 16-bit PCM format (machine endian).
42-
Clean speech data can be obtained from https://media.xiph.org/rnnoise/data/tts_speech_48k.sw
42+
Clean speech data can be obtained from
43+
https://media.xiph.org/rnnoise/data/tts_speech_48k.sw
44+
For noise data, we suggest concatenating the 48 kHz noise data from DEMAND at
45+
https://zenodo.org/records/1227121
46+
with contrib_noise.sw and synthetic_noise.sw noise files from
47+
https://media.xiph.org/rnnoise/data/
48+
To balance out the data, we recommend using multiple (e.g. 5) copies of the
49+
contrib_noise.sw and synthetic_noise.sw noise files.
50+
4351
The first step is to take the speech and noise, and mix them in a variety of ways
4452
to simulate real life conditions (including pauses, filtering and more).
4553
Assuming the files are called speech.pcm and noise.pcm, start by generating

0 commit comments

Comments
 (0)