AI-Generated Music Detection and its Challenges

Darius Afchar; Gabriel Meseguer-Brocal; Romain Hennequin

AI-Generated Music Detection and its Challenges

Darius Afchar, Gabriel Meseguer-Brocal, Romain Hennequin

TL;DR

The paper addresses the problem of detecting AI-generated music and the risks it poses to artists and streaming platforms. It introduces a waveform-based detector that leverages artefacts from neural decoders by training on autoencoded real audio at the same bitrate, achieving high accuracy with $99.8\%$ on amplitude-spectrogram inputs and $99.9\%$ on unseen decoders like MusicGen. While the results are promising, the study reveals vulnerabilities to common audio manipulations and limited cross-decoder generalisation, underscoring the need for open evaluation, interpretability, and continual updates. The work advocates broader regulatory considerations, potential watermarking approaches, and future directions in adaptive defenses, forensics, and governance to ensure robust detection in real-world settings.

Abstract

In the face of a new era of generative models, the detection of artificially generated content has become a matter of utmost importance. In particular, the ability to create credible minute-long synthetic music in a few seconds on user-friendly platforms poses a real threat of fraud on streaming services and unfair competition to human artists. This paper demonstrates the possibility (and surprising ease) of training classifiers on datasets comprising real audio and artificial reconstructions, achieving a convincing accuracy of 99.8%. To our knowledge, this marks the first publication of a AI-music detector, a tool that will help in the regulation of synthetic media. Nevertheless, informed by decades of literature on forgery detection in other fields, we stress that getting a good test score is not the end of the story. We expose and discuss several facets that could be problematic with such a deployed detector: robustness to audio manipulation, generalisation to unseen models. This second part acts as a position for future research steps in the field and a caveat to a flourishing market of artificial content checkers.

AI-Generated Music Detection and its Challenges

TL;DR

on amplitude-spectrogram inputs and

on unseen decoders like MusicGen. While the results are promising, the study reveals vulnerabilities to common audio manipulations and limited cross-decoder generalisation, underscoring the need for open evaluation, interpretability, and continual updates. The work advocates broader regulatory considerations, potential watermarking approaches, and future directions in adaptive defenses, forensics, and governance to ensure robust detection in real-world settings.

Abstract

Paper Structure (11 sections, 3 tables)

This paper contains 11 sections, 3 tables.

Introduction
AI-music detection
Proposed framework
Considered dataset and music generators
First results of detection
Generalisation to full music generators
Caveats on AI-music detectors
Robustness to manipulations
Encoder generalisation
Challenges ahead
Conclusion

AI-Generated Music Detection and its Challenges

TL;DR

Abstract

AI-Generated Music Detection and its Challenges

Authors

TL;DR

Abstract

Table of Contents