Visualize attention maps on input spectrograms #2177

mmerler · 2024-05-14T05:31:19Z

mmerler
May 14, 2024

Does anyone know how to visualize the encoder attention maps with respect to the input spectrograms?
I'm interested in understanding which portions of the spectrogram a whisper-base fine-tuned model is focusing on when making a prediction.
I can extract the attention maps in the forward pass, each is 1500x1500, but I don't know how to map them back to the input spectrogram.

Any ideas?

mmerler · 2024-05-14T22:38:54Z

mmerler
May 14, 2024
Author

basically the equivalent of Grad-Cam for audio with whisper?

0 replies

Coder1010ayush · 2024-12-16T12:17:08Z

Coder1010ayush
Dec 16, 2024

Any updates on this?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visualize attention maps on input spectrograms #2177

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Visualize attention maps on input spectrograms #2177

mmerler May 14, 2024

Replies: 2 comments

mmerler May 14, 2024 Author

Coder1010ayush Dec 16, 2024

mmerler
May 14, 2024

mmerler
May 14, 2024
Author

Coder1010ayush
Dec 16, 2024