CMP: A Composable Meta Prompt for SAM-Based Cross-Domain Few-Shot Segmentation

📅 2025-07-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the generalization bottleneck in cross-domain few-shot segmentation (CD-FSS) caused by data scarcity and domain shift, this paper proposes a composable meta-prompting framework tailored for the Segment Anything Model (SAM). To mitigate SAM’s reliance on handcrafted prompts and its limited cross-domain adaptability, we introduce three key components: reference augmentation and transformation, composable meta-prompt generation, and frequency-domain-aware interaction—enabling automatic prompt construction, semantic expansion, and domain-difference suppression. Crucially, our framework avoids fine-tuning SAM’s backbone, achieving strong cross-domain transfer solely via lightweight prompt engineering. Evaluated on four standard CD-FSS benchmarks, it achieves 71.8% and 74.5% mIoU under 1-shot and 5-shot settings, respectively—outperforming prior methods significantly. This work establishes an efficient, generalizable, and interpretable paradigm for CD-FSS.

Technology Category

Application Category

📝 Abstract
Cross-Domain Few-Shot Segmentation (CD-FSS) remains challenging due to limited data and domain shifts. Recent foundation models like the Segment Anything Model (SAM) have shown remarkable zero-shot generalization capability in general segmentation tasks, making it a promising solution for few-shot scenarios. However, adapting SAM to CD-FSS faces two critical challenges: reliance on manual prompt and limited cross-domain ability. Therefore, we propose the Composable Meta-Prompt (CMP) framework that introduces three key modules: (i) the Reference Complement and Transformation (RCT) module for semantic expansion, (ii) the Composable Meta-Prompt Generation (CMPG) module for automated meta-prompt synthesis, and (iii) the Frequency-Aware Interaction (FAI) module for domain discrepancy mitigation. Evaluations across four cross-domain datasets demonstrate CMP's state-of-the-art performance, achieving 71.8% and 74.5% mIoU in 1-shot and 5-shot scenarios respectively.
Problem

Research questions and friction points this paper is trying to address.

Adapting SAM to Cross-Domain Few-Shot Segmentation challenges
Reducing reliance on manual prompts in segmentation tasks
Mitigating domain shifts in few-shot segmentation scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

RCT module enables semantic expansion
CMPG automates meta-prompt synthesis
FAI mitigates domain discrepancy
🔎 Similar Papers
S
Shuai Chen
University of Electronic Science and Technology of China
F
Fanman Meng
University of Electronic Science and Technology of China
C
Chunjin Yang
University of Electronic Science and Technology of China
H
Haoran Wei
University of Electronic Science and Technology of China
C
Chenhao Wu
University of Electronic Science and Technology of China
Qingbo Wu
Qingbo Wu
University of Electronic Science and Technology of China
video codingimage and video quality assessment
H
Hongliang Li
University of Electronic Science and Technology of China