๐ค AI Summary
This work addresses the trade-off between efficiency and accuracy in modeling long-range temporal dependencies for automatic chord recognition by proposing BMACE, the first model to introduce a bidirectional Mamba architecture to this task. Built upon selective structured state space models, BMACE integrates bidirectional temporal modeling with deep learningโbased audio feature extraction to effectively capture long-range dependencies in musical signals. Experimental results demonstrate that BMACE achieves chord recognition accuracy comparable to state-of-the-art methods on standard benchmarks while substantially reducing both model parameters and computational overhead, thereby unifying high accuracy with high efficiency.
๐ Abstract
In this work, we propose a new efficient solution, which is a Mamba-based model named BMACE (Bidirectional Mamba-based network, for Automatic Chord Estimation), which utilizes selective structured state-space models in a bidirectional Mamba layer to effectively model temporal dependencies. Our model achieves high prediction performance comparable to state-of-the-art models, with the advantage of requiring fewer parameters and lower computational resources