AAT: ADAPTING AUDIO TRANSFORMER FOR VARIOUS ACOUSTICS RECOGNITION TASKS

Introduction

This repository contains the official implementation (in PyTorch) of the AAT: ADAPTING AUDIO TRANSFORMER FOR VARIOUS ACOUSTICS RECOGNITION TASKS submitted to the ICASSP 2024.

Key Files

The model is implemented in ./src/models/ast_models.py. You may refer to it on how to apply AAT to your model.
The recipes are in tasks/[esc50, speechcommands, speechcommandsv1, gtzan, openmic, urbansound8k]/run_xx.sh, when you run run_xx.sh, it will call /src/run.py, which will then call /src/dataloader.py and /src/traintest.py, which will then call /src/models/ast_models.py.

Acknowledgement

Thanks YuanGongND for providing such an amazing training pipline. When the paper is accepted, we will further complete this README.

Contact

If you have a question, please bring up an issue (preferred) or send me an email sealynndev@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
pretrained_models		pretrained_models
src		src
tasks		tasks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AAT: ADAPTING AUDIO TRANSFORMER FOR VARIOUS ACOUSTICS RECOGNITION TASKS

Introduction

Key Files

Acknowledgement

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

MichaelLynn1996/AAT

Folders and files

Latest commit

History

Repository files navigation

AAT: ADAPTING AUDIO TRANSFORMER FOR VARIOUS ACOUSTICS RECOGNITION TASKS

Introduction

Key Files

Acknowledgement

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages