Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering

Created by
  • Haebom

Author

Jan Melechovsky, Ambuj Mehrish, Abhinaba Roy, Dorien Herremans

SonicMaster: A Unified Generative Model for Text-Based Music Restoration and Mastering

Outline

This paper presents SonicMaster, the first unified generative model that addresses various audio artifacts through text-based control to address common sound quality issues in music recordings produced without professional equipment or expertise, such as excessive reverberation, distortion, clipping, timbre imbalance, and narrow stereo imaging. SonicMaster applies specific enhancements based on natural language instructions, or operates in an automatic mode for general restoration. To train this model, the authors built the SonicMaster dataset, a large-scale dataset of degraded and high-quality tracks, by simulating common degradation types using 19 degradation functions belonging to five enhancement groups: equalization, dynamics, reverb, amplitude, and stereo. This approach utilizes a flow-matching generative training paradigm to learn audio transformations from degraded input to a cleaned and mastered version, guided by text prompts.

Takeaways, Limitations

Takeaways:
We present the first unified generative model for resolving audio artifacts via text-based control.
Provides effective solutions to various sound quality problems.
Demonstrate model performance through objective and subjective evaluations.
Limitations:
Specific information about Limitations is not provided in the abstract.
👍