Replies: 1 comment
-
Thanks @zhiyunfan for your feedback. Did you train a model directly for speaker change detection ? or did you train a speaker embedding model and then used it for speaker change detection? Also note that upcoming pyannote.audio 2.0 (in develop branch) uses batchnorm. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello, bredin, I have tried to use pyannote.audio 1.1.1 to do speaker change detection. And I used the tdnn module to downsample my features. In the pyannote 1.0, tdnn is followed by a weight norm. When I tried to replace the weight norm with a batch norm, I got an obviously gain. And as far as I know, batch norm is most used in speaker related tasks. So, I suggest that maybe you can try batch norm.
Beta Was this translation helpful? Give feedback.
All reactions