A Few-Shot Metric Learning Method with Dual-Channel Attention for Cross-Modal Same-Neuron Identification

📅 2025-04-23
📈 Citations: 0
Influential: 0
📄 PDF

career value

201K/year
🤖 AI Summary
To address the challenge of precise cross-modal (two-photon vs. fMOST) single-neuron matching under limited annotations, this paper proposes a few-shot robust recognition framework. Methodologically, it introduces: (1) a novel dual-channel attention mechanism that decouples somatic morphology from axonal/dendritic fiber context; (2) a joint optimization strategy combining MultiSimilarityMiner for hard-sample mining and Circle Loss to enhance discriminative feature learning; and (3) an integrated architecture incorporating a pretrained Vision Transformer backbone, gated feature fusion, and complementary local-global attention. Evaluated on real-world datasets, the method achieves significantly higher Top-K accuracy and recall compared to state-of-the-art approaches. Ablation studies and efficiency analysis confirm the effectiveness and training cost-effectiveness of each component. This work establishes a scalable technical paradigm for multimodal structure–function correlation analysis in neuroscience.

Technology Category

Application Category

📝 Abstract
In neuroscience research, achieving single-neuron matching across different imaging modalities is critical for understanding the relationship between neuronal structure and function. However, modality gaps and limited annotations present significant challenges. We propose a few-shot metric learning method with a dual-channel attention mechanism and a pretrained vision transformer to enable robust cross-modal neuron identification. The local and global channels extract soma morphology and fiber context, respectively, and a gating mechanism fuses their outputs. To enhance the model's fine-grained discrimination capability, we introduce a hard sample mining strategy based on the MultiSimilarityMiner algorithm, along with the Circle Loss function. Experiments on two-photon and fMOST datasets demonstrate superior Top-K accuracy and recall compared to existing methods. Ablation studies and t-SNE visualizations validate the effectiveness of each module. The method also achieves a favorable trade-off between accuracy and training efficiency under different fine-tuning strategies. These results suggest that the proposed approach offers a promising technical solution for accurate single-cell level matching and multimodal neuroimaging integration.
Problem

Research questions and friction points this paper is trying to address.

Achieving single-neuron matching across different imaging modalities
Addressing modality gaps and limited annotations in neuron identification
Enhancing fine-grained discrimination for cross-modal neuron matching
Innovation

Methods, ideas, or system contributions that make the work stand out.

Few-shot metric learning with dual-channel attention
Hard sample mining via MultiSimilarityMiner algorithm
Pretrained vision transformer for cross-modal matching
🔎 Similar Papers
No similar papers found.
W
Wenwei Li
Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics, MoE Key Laboratory for Biomedical Photonics, Huazhong University of Science and Technology, Wuhan 430074, China.
L
Liyi Cai
Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics, MoE Key Laboratory for Biomedical Photonics, Huazhong University of Science and Technology, Wuhan 430074, China.
Wu Chen
Wu Chen
Hong Kong Polytechnic University
GNSSSurveyingIndoor positioningIonospheric ScintillationIntegrity
Anan Li
Anan Li
Hainan University
Biomedical EngineeringNeuroinformaticsBrainsmaticsNeuroscienceSoftware Engineering