Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

AI-Generated Song Detection via Lyrics Transcripts

Created by
  • Haebom

Author

Markus Frohmann, Elena V. Epure, Gabriel Meseguer-Brocal, Markus Schedl, Romain Hennequin

Outline

This paper is based on the growing need for accurate methods to detect AI-generated music due to the advancement of AI-based music generation tools. We point out that existing audio-based detection methods have difficulty in generalizing to new generators or noisy audio, and methods that use accurate and well-formed lyrics data also have limitations in practical applications. Therefore, this study proposes a novel method to detect AI-generated music by converting songs to speech using a general automatic speech recognition (ASR) model and then utilizing multiple detectors. Experimental results on lyrics of various genres and languages show that models using Whisper large-v2 and LLM2Vec embeddings perform well, and are more robust than existing audio-based methods to audio noise and various music generators. The code is available on GitHub.

Takeaways, Limitations

Takeaways:
We present the effectiveness of an AI-generated music detection method using ASR-based lyric transcription.
We experimentally demonstrate that it is more robust to noise and various generators than existing audio-based methods.
We evaluated the detection performance for multilingual and multi-genre music to enhance practicality.
Reproducibility and usability have been improved through open code.
Limitations:
Detection performance may be affected by the accuracy of the ASR model.
Generalization performance for new AI music generation models requires further research.
There may be bias toward certain genres or languages.
👍