This repository contains the official implementation (in PyTorch) of the AAT: ADAPTING AUDIO TRANSFORMER FOR VARIOUS ACOUSTICS RECOGNITION TASKS submitted to the ICASSP 2024.
- The model is implemented in ./src/models/ast_models.py. You may refer to it on how to apply AAT to your model.
- The recipes are in
tasks/[esc50, speechcommands, speechcommandsv1, gtzan, openmic, urbansound8k]/run_xx.sh, when you runrun_xx.sh, it will call/src/run.py, which will then call/src/dataloader.pyand/src/traintest.py, which will then call/src/models/ast_models.py.
Thanks YuanGongND for providing such an amazing training pipline. When the paper is accepted, we will further complete this README.
If you have a question, please bring up an issue (preferred) or send me an email sealynndev@gmail.com.