🤖 AI Summary
This study addresses the challenge of automatically identifying singing errors in music education by proposing the first teaching-oriented singing error detection framework. Leveraging synchronously recorded audio from both teachers and students, the authors construct a dedicated dataset with a fine-grained error annotation scheme and develop a deep learning model for error recognition. Experimental results demonstrate that the proposed method significantly outperforms traditional rule-based baselines. A systematic analysis further reveals the impact of inter-teacher instructional variability on error detection performance. This work contributes a novel benchmark dataset, an evaluation methodology, and actionable pedagogical insights for intelligent music education systems.
📝 Abstract
The advancement of machine learning in audio analysis has opened new possibilities for technology-enhanced music education. This paper introduces a framework for automatic singing mistake detection in the context of music pedagogy, supported by a newly curated dataset. The dataset comprises synchronized teacher learner vocal recordings, with annotations marking different types of mistakes made by learners. Using this dataset, we develop different deep learning models for mistake detection and benchmark them. To compare the efficacy of mistake detection systems, a new evaluation methodology is proposed. Experiments indicate that the proposed learning-based methods are superior to rule-based methods. A systematic study of errors and a cross-teacher study reveal insights into music pedagogy that can be utilised for various music applications. This work sets out new directions of research in music pedagogy. The codes and dataset are publicly available.