MEGConformer: Conformer-Based MEG Decoder for Robust Speech and Phoneme Classification

📅 2025-12-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses two fundamental tasks in magnetoencephalography (MEG) signal decoding—speech detection and phoneme classification—and proposes the first unified Conformer architecture specifically designed for MEG. Methodologically, it introduces a lightweight convolutional projection layer to handle raw 306-channel MEG data; incorporates task-specific decoding heads; designs an MEG-tailored SpecAugment strategy; employs a dynamic grouped loading mechanism for hundred-trial-averaged data; and applies instance-level normalization to mitigate distribution shift. Optimization leverages inverse-square-root class weighting and F1-macro–driven model selection. Experiments on the official benchmark yield leaderboard scores of 88.9% for speech detection and 65.8% for phoneme classification—substantially outperforming baselines and placing both tasks within the top ten. The proposed framework establishes a scalable, robust new paradigm for MEG-based speech decoding.

Technology Category

Application Category

📝 Abstract
We present Conformer-based decoders for the LibriBrain 2025 PNPL competition, targeting two foundational MEG tasks: Speech Detection and Phoneme Classification. Our approach adapts a compact Conformer to raw 306-channel MEG signals, with a lightweight convolutional projection layer and task-specific heads. For Speech Detection, a MEG-oriented SpecAugment provided a first exploration of MEG-specific augmentation. For Phoneme Classification, we used inverse-square-root class weighting and a dynamic grouping loader to handle 100-sample averaged examples. In addition, a simple instance-level normalization proved critical to mitigate distribution shifts on the holdout split. Using the official Standard track splits and F1-macro for model selection, our best systems achieved 88.9% (Speech) and 65.8% (Phoneme) on the leaderboard, surpassing the competition baselines and ranking within the top-10 in both tasks. For further implementation details, the technical documentation, source code, and checkpoints are available at https://github.com/neural2speech/libribrain-experiments.
Problem

Research questions and friction points this paper is trying to address.

Develop Conformer-based decoder for MEG signal classification
Address speech detection and phoneme classification tasks
Mitigate distribution shifts and handle imbalanced data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Conformer-based decoder for raw MEG signals
MEG-oriented SpecAugment for data augmentation
Instance-level normalization to mitigate distribution shifts
🔎 Similar Papers
No similar papers found.
X
Xabier de Zuazo
HiTZ Center, Dept. of Communications Engineering, School of Engineering, University of the Basque Country (UPV/EHU), Bilbao, Spain
Ibon Saratxaga
Ibon Saratxaga
University of the Basque Country (UPV/EHU)
Speechsound classification
Eva Navas
Eva Navas
University of the Basque Country
Speech synthesisspeaker diarization