weight norm used in tdnn module #760

zhiyunfan · 2021-09-28T02:11:17Z

zhiyunfan
Sep 28, 2021

Hello, bredin, I have tried to use pyannote.audio 1.1.1 to do speaker change detection. And I used the tdnn module to downsample my features. In the pyannote 1.0, tdnn is followed by a weight norm. When I tried to replace the weight norm with a batch norm, I got an obviously gain. And as far as I know, batch norm is most used in speaker related tasks. So, I suggest that maybe you can try batch norm.

hbredin · 2021-09-28T19:01:33Z

hbredin
Sep 28, 2021
Maintainer

Thanks @zhiyunfan for your feedback.

Did you train a model directly for speaker change detection ? or did you train a speaker embedding model and then used it for speaker change detection?

Also note that upcoming pyannote.audio 2.0 (in develop branch) uses batchnorm.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

weight norm used in tdnn module #760

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

weight norm used in tdnn module #760

Uh oh!

zhiyunfan Sep 28, 2021

Replies: 1 comment

Uh oh!

hbredin Sep 28, 2021 Maintainer

zhiyunfan
Sep 28, 2021

hbredin
Sep 28, 2021
Maintainer