Speaker-segmentation pipeline source code #1804
Unanswered
Spectra456
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I'm trying to obtain Voice Activity Detection (VAD) results combined with Speaker Change Detection. I found that the pyannote/speaker-segmentation pipeline seems to be the best fit for my purposes. However, I don't fully understand what exactly happens inside this pipeline, as I couldn't find the relevant source code in the PyAnnote repository. Could you help me understand it better?
I tried replicating the results based on this tutorial https://herve.niderb.fr/fastpages/2022/10/23/One-speaker-segmentation-model-to-rule-them-all.html, but the outcomes were very different, even with the same hyperparameters. Thanks a lot for your help!
Beta Was this translation helpful? Give feedback.
All reactions