Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching

๐Ÿ“… 2025-03-05
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Medical image segmentation heavily relies on large-scale annotated datasets, yet existing few-shot methods still require substantial training samples, hindering efficient clinical deployment. To address this, we propose a training-free few-shot 3D medical image segmentation framework: 3D volumetric data is reformulated as a video sequence, leveraging SAM2โ€™s temporal modeling capability; only a single support image is neededโ€”through data augmentation and frame-level dynamic matching, mask prompts are generated to directly drive SAM2 for query volume segmentation. Key contributions include: (i) the first training-free paradigm that eliminates fine-tuning; (ii) a support-query frame similarity-driven dynamic matching strategy; and (iii) the first reformulation of 3D medical segmentation as a video segmentation task. Our method achieves state-of-the-art performance on mainstream few-shot benchmarks, significantly improving both segmentation accuracy and annotation efficiency, while offering plug-and-play clinical applicability.

Technology Category

Application Category

๐Ÿ“ Abstract
The reliance on large labeled datasets presents a significant challenge in medical image segmentation. Few-shot learning offers a potential solution, but existing methods often still require substantial training data. This paper proposes a novel approach that leverages the Segment Anything Model 2 (SAM2), a vision foundation model with strong video segmentation capabilities. We conceptualize 3D medical image volumes as video sequences, departing from the traditional slice-by-slice paradigm. Our core innovation is a support-query matching strategy: we perform extensive data augmentation on a single labeled support image and, for each frame in the query volume, algorithmically select the most analogous augmented support image. This selected image, along with its corresponding mask, is used as a mask prompt, driving SAM2's video segmentation. This approach entirely avoids model retraining or parameter updates. We demonstrate state-of-the-art performance on benchmark few-shot medical image segmentation datasets, achieving significant improvements in accuracy and annotation efficiency. This plug-and-play method offers a powerful and generalizable solution for 3D medical image segmentation.
Problem

Research questions and friction points this paper is trying to address.

Reduces reliance on large labeled datasets for medical image segmentation.
Introduces a training-free framework using SAM2 for few-shot learning.
Enhances 3D medical image segmentation accuracy and annotation efficiency.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses SAM2 for video-like 3D medical segmentation
Implements augmentative prompting with dynamic matching
Eliminates need for model retraining or updates
๐Ÿ”Ž Similar Papers
No similar papers found.
H
Haiyue Zu
Department of Orthopaedics, The First Affiliated Hospital of Soochow University, Soochow University, Suzhou, 215006, China.
J
Jun Ge
Department of Orthopaedics, The First Affiliated Hospital of Soochow University, Soochow University, Suzhou, 215006, China.
H
Heting Xiao
Department of Orthopaedics, The First Affiliated Hospital of Soochow University, Soochow University, Suzhou, 215006, China.
J
Jile Xie
Department of Orthopaedics, The First Affiliated Hospital of Soochow University, Soochow University, Suzhou, 215006, China.
Z
Zhangzhe Zhou
Department of Orthopaedics, The First Affiliated Hospital of Soochow University, Soochow University, Suzhou, 215006, China.
Y
Yifan Meng
Independent Researcher.
Jiayi Ni
Jiayi Ni
Department of Orthopaedics, The First Affiliated Hospital of Soochow University, Soochow University, Suzhou, 215006, China.
J
Junjie Niu
Department of Orthopaedics, The First Affiliated Hospital of Soochow University, Soochow University, Suzhou, 215006, China.
L
Linlin Zhang
Department of Orthopaedics, The First Affiliated Hospital of Soochow University, Soochow University, Suzhou, 215006, China.
Li Ni
Li Ni
School of Computer Science and Technology, Anhui University
Machine LearningData MiningclusteringCommunity Detection
H
Huilin Yang
Department of Orthopaedics, The First Affiliated Hospital of Soochow University, Soochow University, Suzhou, 215006, China.