Confidence-Calibrating Regularization for Robust Brain MRI Segmentation Under Domain Shift

📅 2025-09-27

📈 Citations: 0

✨ Influential: 0

career value

202K/year

🤖 AI Summary

To address poor generalization and inadequate uncertainty calibration of the Segment Anything Model (SAM) in brain MRI segmentation—caused by domain shift and overconfidence—this paper proposes CalSAM, a lightweight adaptive framework. CalSAM freezes SAM’s encoder and fine-tunes only the decoder. It introduces two novel penalties: (i) a feature-level Fisher information penalty to suppress domain sensitivity, and (ii) a voxel-wise confidence misalignment penalty to calibrate prediction reliability. This dual-penalty mechanism jointly enhances robustness and calibration across multi-center and multi-device scenarios while preserving computational efficiency. On the BraTS scanner transfer task, CalSAM achieves a Dice Similarity Coefficient (DSC) of 80.1% (+significant gain) and reduces Hausdorff Distance at 95% (HD95) by 26.9%. On ATLAS-C motion-corrupted data, it attains a DSC of 75.9% and reduces Expected Calibration Error (ECE) by 32.6%, consistently outperforming baselines. These results validate CalSAM’s effectiveness and strong cross-domain generalizability.

Technology Category

Application Category

📝 Abstract

The Segment Anything Model (SAM) exhibits strong zero-shot performance on natural images but suffers from domain shift and overconfidence when applied to medical volumes. We propose extbf{CalSAM}, a lightweight adaptation framework that (i) reduces encoder sensitivity to domain shift via a emph{Feature Fisher Information Penalty} (FIP) computed on 3D feature maps and (ii) penalizes overconfident voxel-wise errors through a emph{Confidence Misalignment Penalty} (CMP). The combined loss, (mathcal{L}_{mathrm{CalSAM}}) fine-tunes only the mask decoder while keeping SAM's encoders frozen. On cross-center and scanner-shift evaluations, CalSAM substantially improves accuracy and calibration: e.g., on the BraTS scanner split (Siemens$ o$GE) CalSAM shows a $+7.4%$ relative improvement in $mathrm{DSC}$ (80.1% vs. 74.6%), a $-26.9%$ reduction in $mathrm{HD95}$ (4.6 mm vs. 6.3 mm), and a $-39.5%$ reduction in $mathrm{ECE}$ (5.2% vs. 8.6%). On ATLAS-C (motion corruptions), CalSAM achieves a $+5.3%$ relative improvement in $mathrm{DSC}$ (75.9%) and a $-32.6%$ reduction in $mathrm{ECE}$ (5.8%). Ablations show FIP and CMP contribute complementary gains ($p<0.01$), and the Fisher penalty incurs a modest $sim$15% training-time overhead. CalSAM therefore delivers improved domain generalization and better-calibrated uncertainty estimates for brain MRI segmentation, while retaining the computational benefits of freezing SAM's encoder.

Problem

Research questions and friction points this paper is trying to address.

Addressing domain shift in brain MRI segmentation

Reducing overconfidence in medical volume segmentation

Improving calibration and accuracy under scanner variations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Penalizes domain shift sensitivity via Feature Fisher Information

Reduces overconfident errors with Confidence Misalignment Penalty

Fine-tunes only SAM mask decoder while freezing encoders

🔎 Similar Papers

No similar papers found.