Visualize attention maps on input spectrograms #2177

mmerler · 2024-05-14T05:31:19Z

mmerler
May 14, 2024

Does anyone know how to visualize the encoder attention maps with respect to the input spectrograms?
I'm interested in understanding which portions of the spectrogram a whisper-base fine-tuned model is focusing on when making a prediction.
I can extract the attention maps in the forward pass, each is 1500x1500, but I don't know how to map them back to the input spectrogram.

Any ideas?

mmerler · 2024-05-14T22:38:54Z

mmerler
May 14, 2024
Author

basically the equivalent of Grad-Cam for audio with whisper?

0 replies

Coder1010ayush · 2024-12-16T12:17:08Z

Coder1010ayush
Dec 16, 2024

Any updates on this?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Visualize attention maps on input spectrograms #2177

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Visualize attention maps on input spectrograms #2177

Uh oh!

mmerler May 14, 2024

Replies: 2 comments

Uh oh!

mmerler May 14, 2024 Author

Uh oh!

Coder1010ayush Dec 16, 2024

mmerler
May 14, 2024

mmerler
May 14, 2024
Author

Coder1010ayush
Dec 16, 2024