Behind the Scenes: Mechanistic Interpretability of LoRA-adapted Whisper for Speech Emotion Recognition

Official implementation of the paper "Behind the Scenes: Mechanistic Interpretability of LoRA-adapted Whisper for Speech Emotion Recognition" (arXiv:2509.08454).

Abstract

Large pre-trained speech models such as Whisper offer strong generalization but pose significant challenges for resource-efficient adaptation. Low-Rank Adaptation (LoRA) has become a popular parameter-efficient fine-tuning method, yet its underlying mechanisms in speech tasks remain poorly understood. In this work, we conduct the first systematic mechanistic interpretability study of LoRA within the Whisper encoder for speech emotion recognition (SER). Using a suite of analytical tools, including layer contribution probing, logit-lens inspection, and representational similarity via singular value decomposition (SVD) and centered kernel alignment (CKA), we reveal two key mechanisms: a delayed specialization process that preserves general features in early layers before consolidating task-specific information, and a forward alignment, backward differentiation dynamic between LoRA’s matrices. Our findings clarify how LoRA reshapes encoder hierarchies, providing both empirical insights and a deeper mechanistic understanding for designing efficient and interpretable adaptation strategies in large speech models.

Quick Start

Installation

pip install -r requirements.txt

Training LoRA-adapted Whisper

python train_lora.py

Running Mechanistic Analysis

jupyter notebook analysis.ipynb

Citation

@misc{ma2025behind
      title={Behind the Scenes: Mechanistic Interpretability of LoRA-adapted Whisper for Speech Emotion Recognition}, 
      author={Yujian Ma and Jinqiu Sang and Ruizhe Li},
      year={2025},
      eprint={2509.08454},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2509.08454}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
analysis.ipynb		analysis.ipynb
analysis_utils.py		analysis_utils.py
deepspeed_example.json		deepspeed_example.json
efficiency_comparison.png		efficiency_comparison.png
four_svd_comparison.png		four_svd_comparison.png
lora_tsne_comparison.png		lora_tsne_comparison.png
requirements.txt		requirements.txt
train_lora.py		train_lora.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Behind the Scenes: Mechanistic Interpretability of LoRA-adapted Whisper for Speech Emotion Recognition

Abstract

Quick Start

Installation

Training LoRA-adapted Whisper

Running Mechanistic Analysis

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Behind the Scenes: Mechanistic Interpretability of LoRA-adapted Whisper for Speech Emotion Recognition

Abstract

Quick Start

Installation

Training LoRA-adapted Whisper

Running Mechanistic Analysis

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages