You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: examples/dnn_beamformer/README.md
+5-1Lines changed: 5 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,11 @@
2
2
3
3
This directory contains sample implementations of training and evaluation pipelines for an DNN Beamforming model.
4
4
5
-
The `DNNBeamformer` model composes the following componenst:
5
+
> [!NOTE]
6
+
> The code uses the `torchaudio.info` function which was removed from the library in version 2.9. As such, this code will
7
+
> not run as written without modification. It is preserved for reference purposes only.
8
+
9
+
The `DNNBeamformer` model composes the following components:
6
10
7
11
+[`torchaudio.transforms.Spectrogram`](https://pytorch.org/audio/stable/generated/torchaudio.transforms.Spectrogram.html#spectrogram) that applies Short-time Fourier Transform (STFT) to the waveform.
8
12
+ ConvTasNet without encoder/decoder that predicts T-F masks for speech and noise, respectively.
Copy file name to clipboardExpand all lines: examples/hubert/README.md
+4Lines changed: 4 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,6 +2,10 @@
2
2
3
3
This directory contains sample implementations of pre-training pipeline for [HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units](https://arxiv.org/abs/2106.07447).
4
4
5
+
> [!NOTE]
6
+
> The code uses the `torchaudio.info` function which was removed from the library in version 2.9. As such, this code will
7
+
> not run as written without modification. It is preserved for reference purposes only.
8
+
5
9
## Pre-training Usage
6
10
7
11
The Base architecture of HuBERT model requires two iterations of pre-training.
0 commit comments