Echoes: A semantically-aligned music deepfake detection dataset

๐Ÿ“… 2026-03-24
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿ“ Abstract
We introduce Echoes, a new dataset for music deepfake detection designed for training and benchmarking detectors under realistic and provider-diverse conditions. Echoes comprises 3,577 tracks (110 hours of audio) spanning multiple genres (pop, rock, electronic), and includes content generated by ten popular AI music generation systems. To prevent shortcut learning and promote robust generalization, the dataset is deliberately constructed to be challenging, enforcing semantic-level alignment between spoofed audio and bona fide references. This alignment is achieved by conditioning generated audio samples directly on bona-fide waveforms or song descriptors. We evaluate Echoes in a cross-dataset setting against three existing AI-generated music datasets using state-of-the-art Wav2Vec2 XLS-R 2B representations. Results show that (i) Echoes is the hardest in-domain dataset; (ii) detectors trained on existing datasets transfer poorly to Echoes; (iii) training on Echoes yields the strongest generalization performance. These findings suggest that provider diversity and semantic alignment help learn more transferable detection cues.
Problem

Research questions and friction points this paper is trying to address.

music deepfake detection
semantic alignment
dataset generalization
AI-generated music
spoof detection
Innovation

Methods, ideas, or system contributions that make the work stand out.

semantic alignment
music deepfake detection
dataset design
generalization
AI-generated audio
๐Ÿ”Ž Similar Papers
No similar papers found.
O
Octavian Pascu
National University of Science and Technology POLITEHNICA Bucharest, Romania
D
Dan Oneata
National University of Science and Technology POLITEHNICA Bucharest, Romania
Horia Cucu
Horia Cucu
POLITEHNICA Bucharest
Speech and Language Processing
Nicolas M. Mรผller
Nicolas M. Mรผller
Fraunhofer AISEC
Machine LearningSpoof DetectionVoice Synthesis