Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim^1, Wonjun Kang^1, Yuchen Zeng^*2, Hyung Il Koo¹, Kangwook Lee²

¹ FuriosaAI, ² UW-Madison

Abstract: Deep State Space Models (SSMs), such as Mamba (Gu & Dao, 2024), have become powerful tools for language modeling, offering high performance and linear scalability with sequence length. However, the application of parameter-efficient fine-tuning (PEFT) methods to SSM-based models remains largely underexplored. We start by investigating two fundamental questions on existing PEFT methods: (i) How do they perform on SSM-based models? (ii) Which parameters should they target for optimal results? Our analysis shows that LoRA and its variants consistently outperform all other PEFT methods. While LoRA is effective for linear projection matrices, it fails on SSM modules—yet still outperforms other methods applicable to SSMs, indicating their limitations. This underscores the need for a specialized SSM tuning approach. To address this, we propose Sparse Dimension Tuning (SDT), a PEFT method tailored for SSM modules. Combining SDT for SSMs with LoRA for linear projection matrices, we achieve state-of-the-art performance across extensive experiments.

News 🚀

2025-05-01 Our paper has been accepted to ICML 2025! 🎉🎉🎉
2024-11-01 Our paper is selected for oral presentation (5 of 92 accepted papers) at NeurIPS 2024 Workshop FITML! 🎉🎉
2024-10-11 Our paper is available on arXiv!
2024-10-09 Our paper has been accepted to NeurIPS 2024 Workshop FITML! 🎉

Usage

PEFT implementation on S4: Refer to the S4 folder.
PEFT implementation on Mamba: Refer to the mamba-peft folder.

Citation

@article{galim2024parameter,
  title={Parameter-Efficient Fine-Tuning of State Space Models},
  author={Galim, Kevin and Kang, Wonjun and Zeng, Yuchen and Koo, Hyung Il and Lee, Kangwook},
  journal={arXiv preprint arXiv:2410.09016},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
S4		S4
mamba-peft		mamba-peft
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim^1, Wonjun Kang^1, Yuchen Zeng^*2, Hyung Il Koo¹, Kangwook Lee²

¹ FuriosaAI, ² UW-Madison

News 🚀

Usage

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim*1, Wonjun Kang*1, Yuchen Zeng*2, Hyung Il Koo1, Kangwook Lee2 1 FuriosaAI, 2 UW-Madison

News 🚀

Usage

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Kevin Galim^1, Wonjun Kang^1, Yuchen Zeng^*2, Hyung Il Koo¹, Kangwook Lee²

¹ FuriosaAI, ² UW-Madison

Packages