AI-Generated Music Detection and its Challenges
Darius Afchar, Gabriel Meseguer-Brocal, Romain Hennequin
TL;DR
The paper addresses the problem of detecting AI-generated music and the risks it poses to artists and streaming platforms. It introduces a waveform-based detector that leverages artefacts from neural decoders by training on autoencoded real audio at the same bitrate, achieving high accuracy with $99.8\%$ on amplitude-spectrogram inputs and $99.9\%$ on unseen decoders like MusicGen. While the results are promising, the study reveals vulnerabilities to common audio manipulations and limited cross-decoder generalisation, underscoring the need for open evaluation, interpretability, and continual updates. The work advocates broader regulatory considerations, potential watermarking approaches, and future directions in adaptive defenses, forensics, and governance to ensure robust detection in real-world settings.
Abstract
In the face of a new era of generative models, the detection of artificially generated content has become a matter of utmost importance. In particular, the ability to create credible minute-long synthetic music in a few seconds on user-friendly platforms poses a real threat of fraud on streaming services and unfair competition to human artists. This paper demonstrates the possibility (and surprising ease) of training classifiers on datasets comprising real audio and artificial reconstructions, achieving a convincing accuracy of 99.8%. To our knowledge, this marks the first publication of a AI-music detector, a tool that will help in the regulation of synthetic media. Nevertheless, informed by decades of literature on forgery detection in other fields, we stress that getting a good test score is not the end of the story. We expose and discuss several facets that could be problematic with such a deployed detector: robustness to audio manipulation, generalisation to unseen models. This second part acts as a position for future research steps in the field and a caveat to a flourishing market of artificial content checkers.
