Controlling the Parameterized Multi-channel Wiener Filter using a tiny neural network

📅 2025-07-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the fundamental trade-off between noise suppression and speech distortion in multi-channel speech enhancement, this paper proposes a lightweight end-to-end neural approach that controls a parametric multi-channel Wiener filter (PMWF) via predicted frequency-domain parameters. Unlike conventional fixed-parameter filters or black-box deep learning models, our method employs a compact neural network to estimate only the essential PMWF control parameters—preserving the filter’s physical interpretability and enabling explicit distortion control—while fully leveraging deep learning’s capacity for complex noise modeling. The architecture is designed for low latency and minimal computational complexity, making it suitable for real-time embedded deployment. Experimental results demonstrate that, under equivalent computational budgets, our method significantly outperforms multiple state-of-the-art baselines in objective metrics (PESQ, STOI) and subjective listening quality. To the best of our knowledge, this work represents the first effective neural parameterization of PMWF achieving joint optimization of efficiency and perceptual speech quality.

Technology Category

Application Category

📝 Abstract
Noise suppression and speech distortion are two important aspects to be balanced when designing multi-channel Speech Enhancement (SE) algorithms. Although neural network models have achieved state-of-the-art noise suppression, their non-linear operations often introduce high speech distortion. Conversely, classical signal processing algorithms such as the Parameterized Multi-channel Wiener Filter ( PMWF) beamformer offer explicit mechanisms for controlling the suppression/distortion trade-off. In this work, we present NeuralPMWF, a system where the PMWF is entirely controlled using a low-latency, low-compute neural network, resulting in a low-complexity system offering high noise reduction and low speech distortion. Experimental results show that our proposed approach results in significantly better perceptual and objective speech enhancement in comparison to several competitive baselines using similar computational resources.
Problem

Research questions and friction points this paper is trying to address.

Balancing noise suppression and speech distortion in multi-channel speech enhancement
Controlling PMWF beamformer trade-off with low-complexity neural network
Achieving high noise reduction and low speech distortion efficiently
Innovation

Methods, ideas, or system contributions that make the work stand out.

Tiny neural network controls PMWF
Balances noise suppression and distortion
Low-latency, low-compute neural solution
🔎 Similar Papers
No similar papers found.
Eric Grinstein
Eric Grinstein
Imperial College London
Audio Signal ProcessingDeep Neural Networks
A
Ashutosh Pandey
Meta Reality Labs
C
Cole Li
Meta Reality Labs
S
Shanmukha Srinivas
Ohio State University
J
Juan Azcarreta
Meta Reality Labs
Jacob Donley
Jacob Donley
Meta
Signal ProcessingSpeech EnhancementMachine LearningArray ProcessingBeamforming
S
Sanha Lee
Meta Reality Labs
Ali Aroudi
Ali Aroudi
Dr. of Engineering
Machine LearningDeep LearningSignal ProcessingBrain-Computer InterfaceNeural Signal Processing
C
Cagdas Bilen
Meta Reality Labs