An Investigation of Test-time Adaptation for Audio Classification under Background Noise

📅 2025-07-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address domain shift induced by background noise in audio classification, this paper proposes CoNMix, a noise-robust test-time adaptation (TTA) method. Unlike existing test-time training (TTT) and test-time entropy minimization (TENT) approaches, CoNMix is the first TTA framework tailored to audio classification under noisy domain shifts, dynamically adapting model parameters during inference using unlabeled test samples. Evaluated on AudioMNIST and SpeechCommands under diverse noise types and signal-to-noise ratios, CoNMix consistently outperforms baseline methods, achieving a minimum error rate of 5.31% on AudioMNIST. These results demonstrate its strong generalization capability and practical efficacy. This work establishes a novel paradigm and an effective technical pathway for TTA in audio classification, advancing robustness to real-world acoustic perturbations.

Technology Category

Application Category

📝 Abstract
Domain shift is a prominent problem in Deep Learning, causing a model pre-trained on a source dataset to suffer significant performance degradation on test datasets. This research aims to address the issue of audio classification under domain shift caused by background noise using Test-Time Adaptation (TTA), a technique that adapts a pre-trained model during testing using only unlabelled test data before making predictions. We adopt two common TTA methods, TTT and TENT, and a state-of-the-art method CoNMix, and investigate their respective performance on two popular audio classification datasets, AudioMNIST (AM) and SpeechCommands V1 (SC), against different types of background noise and noise severity levels. The experimental results reveal that our proposed modified version of CoNMix produced the highest classification accuracy under domain shift (5.31% error rate under 10 dB exercise bike background noise and 12.75% error rate under 3 dB running tap background noise for AM) compared to TTT and TENT. The literature search provided no evidence of similar works, thereby motivating the work reported here as the first study to leverage TTA techniques for audio classification under domain shift.
Problem

Research questions and friction points this paper is trying to address.

Address audio classification under domain shift caused by background noise
Investigate Test-Time Adaptation methods for noise-robust audio classification
Improve classification accuracy using modified CoNMix under severe noise conditions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Test-Time Adaptation for audio classification
Modified CoNMix method for domain shift
Unlabelled test data adaptation pre-prediction
🔎 Similar Papers
No similar papers found.
W
Weichuang Shao
School of Computer Science, The University of Nottingham Malaysia, Semenyih, Malaysia
Iman Yi Liao
Iman Yi Liao
University of Nottingham Malaysia Campus
Computer VisionImage ProcessingMachine LearningComputer Graphics
T
Tomas Henrique Bode Maul
School of Computer Science, The University of Nottingham Malaysia, Semenyih, Malaysia
Tissa Chandesa
Tissa Chandesa
Professor (Assistant), School of Computer Science, University of Nottingham Malaysia
Image ProcessingComputer VisionDeep & Machine LearningGenerative AI