TASAM: Terrain-and-Aware Segment Anything Model for Temporal-Scale Remote Sensing Segmentation

📅 2025-09-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address SAM’s insufficient zero-shot segmentation generalization on remote sensing imagery—caused by complex terrain, multi-scale objects, and temporal dynamics—this paper proposes TASAM, a lightweight domain-adaptation framework that avoids fine-tuning SAM’s backbone. TASAM introduces three synergistic components: (1) a terrain-aware adapter that incorporates digital elevation model (DEM) priors to enhance geospatial contextual modeling; (2) a temporal prompt generator that leverages change cues to guide segmentation; and (3) a multi-scale feature fusion strategy to improve cross-scale consistency. The framework achieves efficient adaptation with minimal architectural overhead. Evaluated on three benchmark remote sensing datasets—LoveDA, iSAID, and WHU-CD—TASAM significantly outperforms both zero-shot SAM and task-specific supervised models, achieving an average mIoU gain of 8.2% while increasing inference latency by less than 5%.

Technology Category

Application Category

📝 Abstract
Segment Anything Model (SAM) has demonstrated impressive zero-shot segmentation capabilities across natural image domains, but it struggles to generalize to the unique challenges of remote sensing data, such as complex terrain, multi-scale objects, and temporal dynamics. In this paper, we introduce TASAM, a terrain and temporally-aware extension of SAM designed specifically for high-resolution remote sensing image segmentation. TASAM integrates three lightweight yet effective modules: a terrain-aware adapter that injects elevation priors, a temporal prompt generator that captures land-cover changes over time, and a multi-scale fusion strategy that enhances fine-grained object delineation. Without retraining the SAM backbone, our approach achieves substantial performance gains across three remote sensing benchmarks-LoveDA, iSAID, and WHU-CD-outperforming both zero-shot SAM and task-specific models with minimal computational overhead. Our results highlight the value of domain-adaptive augmentation for foundation models and offer a scalable path toward more robust geospatial segmentation.
Problem

Research questions and friction points this paper is trying to address.

Adapting SAM for remote sensing terrain challenges
Addressing multi-scale object segmentation in satellite imagery
Capturing temporal land-cover changes in geospatial data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Terrain-aware adapter with elevation priors
Temporal prompt generator for land-cover changes
Multi-scale fusion strategy for object delineation
🔎 Similar Papers
No similar papers found.
Tianyang Wang
Tianyang Wang
University of Alabama at Birmingham
machine learning (deep learning)computer vision
Xi Xiao
Xi Xiao
Oak Ridge National Laboratory | University of Alabama at Birmingham
LLM / MLLM EfficiencyImage / Video GenerationImage / Video Understanding
Gaofei Chen
Gaofei Chen
Undergraduate, University of Alabama at Birmingham
Machine LearningDeep LearningBioinformaticsComputational Biology
H
Hanzhang Chi
University of Alabama at Birmingham, Birmingham, United States
Q
Qi Zhang
Wuhan University, Wuhan, China
G
Guo Cheng
Dalian University of Technology, Dalian, China
Y
Yingrui Ji
University of Chinese Academy of Sciences, Beijing, China