Pyannote.audio toolkit with MFCC #1680
sumansamui
started this conversation in
Development
Replies: 1 comment 1 reply
-
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I have a few doubts:
How to use pyannote.audio setup if we want to extract MFCC, i.e., using the speech segmentation model with the MFCC feature. Is there any pre-trained model available for those settings? Or we have to train from scratch.
What is the impact of sample frequency on Sincnet? I know all the input audio is downsampled or upsampled to 16k.
We observed that Pyannote provides the same result for 8k and 16k versions of a WAV file in the case of SincNet architecture. Is it because of the same number of Sinc filters in the low-frequency range for both 8k and 16k.
Beta Was this translation helpful? Give feedback.
All reactions