Learning from Limited Labels: Transductive Graph Label Propagation for Indian Music Analysis

📅 2026-01-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of limited labeled data hindering supervised learning in Indian art music analysis by proposing a graph-based semi-supervised approach. Specifically, it constructs a similarity graph from audio embeddings and employs transductive label propagation to diffuse information from a small set of annotated samples to a large pool of unlabeled data, enabling effective raga identification and instrument classification. To the best of our knowledge, this is the first study to introduce graph-based label propagation into Indian music analysis. By integrating a multi-source data fusion strategy, the proposed method significantly outperforms conventional baselines, generates high-quality pseudo-labels, and substantially reduces reliance on costly expert annotations while maintaining strong performance.

Technology Category

Application Category

📝 Abstract
Supervised machine learning frameworks rely on extensive labeled datasets for robust performance on real-world tasks. However, there is a lack of large annotated datasets in audio and music domains, as annotating such recordings is resource-intensive, laborious, and often require expert domain knowledge. In this work, we explore the use of label propagation (LP), a graph-based semi-supervised learning technique, for automatically labeling the unlabeled set in an unsupervised manner. By constructing a similarity graph over audio embeddings, we propagate limited label information from a small annotated subset to a larger unlabeled corpus in a transductive, semi-supervised setting. We apply this method to two tasks in Indian Art Music (IAM): Raga identification and Instrument classification. For both these tasks, we integrate multiple public datasets along with additional recordings we acquire from Prasar Bharati Archives to perform LP. Our experiments demonstrate that LP significantly reduces labeling overhead and produces higher-quality annotations compared to conventional baseline methods, including those based on pretrained inductive models. These results highlight the potential of graph-based semi-supervised learning to democratize data annotation and accelerate progress in music information retrieval.
Problem

Research questions and friction points this paper is trying to address.

limited labels
Indian music analysis
label propagation
semi-supervised learning
music annotation
Innovation

Methods, ideas, or system contributions that make the work stand out.

label propagation
graph-based semi-supervised learning
transductive learning
audio embeddings
Indian Art Music
🔎 Similar Papers
No similar papers found.
P
Parampreet Singh
Department of Electrical Engineering, Indian Institution of Technology, Kanpur, India
Akshay Raina
Akshay Raina
Phd, IIT Kanpur
SIGNAL PROCESSINGMACHINE LEARNINGDEEP LEARNING
S
S. I. Sheikh
Department of Chemical Engineering, Indian Institution of Technology, Kanpur, India
V
Vipul Arora
Katholieke Universiteit Leuven