Replies: 1 comment 3 replies
-
Did you read this? This should answer most of your questions about this model. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
pyannote for speaker diarization based on the following segmentation model:
End-to-end speaker segmentation for overlap-aware resegmentation
In the above paper they wrote, under the Implementation details:
[i.e: 5s audio chunks with a sampling rate of 16kHz]
K max -dimensional speaker activations between 0 and 1 every 16ms.
How to conclude if we have a new segment or number of segments in each output ? number of speaker in output ? (example will be very helpful)
Beta Was this translation helpful? Give feedback.
All reactions