Speaker-segmentation pipeline source code #1804

Spectra456 · 2024-12-05T13:20:26Z

Spectra456
Dec 5, 2024

Hi, I'm trying to obtain Voice Activity Detection (VAD) results combined with Speaker Change Detection. I found that the pyannote/speaker-segmentation pipeline seems to be the best fit for my purposes. However, I don't fully understand what exactly happens inside this pipeline, as I couldn't find the relevant source code in the PyAnnote repository. Could you help me understand it better?

I tried replicating the results based on this tutorial https://herve.niderb.fr/fastpages/2022/10/23/One-speaker-segmentation-model-to-rule-them-all.html, but the outcomes were very different, even with the same hyperparameters. Thanks a lot for your help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Speaker-segmentation pipeline source code #1804

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

Speaker-segmentation pipeline source code #1804

Uh oh!

Uh oh!

Spectra456 Dec 5, 2024

Replies: 0 comments

Spectra456
Dec 5, 2024