Mutual Distillation of Dual-Foundation Models for Semi-Supervised PET/CT Segmentation

📅 2026-06-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the high annotation cost and scarcity of labeled data in PET/CT image segmentation by proposing MuDuo, a novel framework that introduces dual-modality foundation models into semi-supervised segmentation for the first time. Leveraging SAM-Med3D (for CT) and SegAnyPET (for PET) as teacher models, MuDuo employs a prompt-free mutual distillation mechanism combined with semi-supervised learning and multimodal alignment strategies to effectively fuse structural and metabolic information into a lightweight student network. Evaluated on the AutoPET dataset using only five annotated cases, MuDuo achieves state-of-the-art performance, significantly improving segmentation accuracy while drastically reducing reliance on labeled data.
📝 Abstract
Organ segmentation from PET/CT is critical for quantitative analysis and radiotherapy planning in oncology. To ease the high annotation cost of PET/CT segmentation, semi-supervised learning (SSL) provides a practical and effective solution for developing deep models with limited labeled data. Recent developments in visual foundation models have demonstrated remarkable adaptability with improved efficiency. In this work, we propose a mutual distillation framework that seamlessly exploits both structural and functional foundation models, which act as modality-specific generalists for distilling knowledge from structural CT and metabolic PET imaging. By bridging the gap between the task-specific precision of student models and the segmentation priors of generalist foundation models, we propose \textbf{MuDuo}, a mutual distillation framework that synergistically leverages SAM-Med3D for CT and SegAnyPET for PET to distill their knowledge into a lightweight student network. Our approach eliminates the need for manual prompts while maximizing the utility of unlabeled data for automatic segmentation, achieving state-of-the-art performance on the AutoPET dataset with only 5 labeled cases. Our source code is available at https://github.com/Wu-beining/MuDuo.
Problem

Research questions and friction points this paper is trying to address.

semi-supervised learning
PET/CT segmentation
organ segmentation
annotation cost
medical image analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

mutual distillation
foundation models
semi-supervised segmentation
PET/CT
modality-specific generalists
🔎 Similar Papers
No similar papers found.
F
Fuyou Mao
Central South University, Changsha 410083, China
B
Beining Wu
Hangzhou Dianzi University, Hangzhou 310018, China
Y
Yanfeng Jiang
Communication University of Zhejiang, Hangzhou 310018, China
Bohan Xu
Bohan Xu
Data Scientist, Laureate Institute for Brain Research
Machine learningOptimizationComputational psychiatryGenetic analyses
L
Lixin Lin
Central South University, Changsha 410083, China
N
Naye Ji
Communication University of Zhejiang, Hangzhou 310018, China
H
Hao Zhang
Central South University, Changsha 410083, China
Y
Yan Tang
Central South University, Changsha 410083, China