SonicMaster

SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering

Overview

Music recordings often suffer from audio quality issues such as excessive reverberation, distortion, clipping, tonal imbalances, and a narrowed stereo image, especially when created in non-professional settings without specialized equipment or expertise. These problems are typically corrected using separate specialized tools and manual adjustments. In this paper, we introduce SonicMaster, the first unified generative model for music restoration and mastering that addresses a broad spectrum of audio artifacts with text-based control. SonicMaster is conditioned on natural language instructions to apply targeted enhancements, or can operate in an automatic mode for general restoration.

Figure 1: SonicVerse architecture for music captioning with feature detection.

Key Features

🎵 Unified Restoration: All-In-One model to simultaneously handle reverb, clipping, EQ, dynamics, and stereo imbalances.
📝 Text-Based Control: Use natural-language instructions (e.g. “reduce reverb”) for fine-grained audio enhancement.
🚀 High-Quality Output: Objective metrics (FAD, SSIM, etc.) and listening tests show significant quality gains.
💾 SonicMaster Dataset: We release a large-scale dataset of 25k (208 hrs) paired clean and degraded music segments with natural-language prompts for training and evaluation.

Installation

To run SonicMaster, you should use python==3.13. Then, install the requirements and clone the repo.

pip install -r requirements_sonic.txt

Training

We trained SonicMaster with pytorch tensor files of our SonicMaster dataset -- for speed. For that, you would first want to pre-encode your audio:

accelerate launch preencode_latents_acce2.py

Then you can start training with the training script that loads pt files from a jsonl metadata file. The script also allows to turn on inference during training (after a certain number of epochs) to monitor your progress.

accelerate launch train_ptload_inference.py

Citation

If you use SonicMaster in your work, please cite our paper:

Jan Melechovsky, Ambuj Mehrish, Dorien Herremans. 2025. SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering. ArXiv:2508.03448

@article{melechovsky2025sonicmaster,
      title={SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering}, 
      author={Jan Melechovsky and Ambuj Mehrish and Dorien Herremans},
      year={2025},
      eprint={2508.03448},
      archivePrefix={arXiv},
      url={https://arxiv.org/abs/2508.03448}, 
}

Read the paper here: arXiv:2508.0338

Made with 🎸 by the AMAAI Lab | Singapore

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
Images		Images
configs		configs
dataset_scripts		dataset_scripts
evaluation		evaluation
samples		samples
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
index.html		index.html
infer_single.py		infer_single.py
inference_fullsong.py		inference_fullsong.py
inference_ptload_batch.py		inference_ptload_batch.py
model.py		model.py
preencode_latents_acce2.py		preencode_latents_acce2.py
requirements_sonic.txt		requirements_sonic.txt
train_ptload_inference.py		train_ptload_inference.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SonicMaster

Overview

Key Features

Installation

Training

Citation

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

AMAAI-Lab/SonicMaster

Folders and files

Latest commit

History

Repository files navigation

SonicMaster

Overview

Key Features

Installation

Training

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages