Auto-Prompting SAM for Weakly Supervised Landslide Extraction

📅 2025-01-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In weakly supervised landslide detection, coarse-grained image-level labels and complex landslide morphologies lead to inaccurate boundary localization. To address this without pixel-level annotations or fine-tuning of the Segment Anything Model (SAM), this paper proposes a prompt-driven segmentation framework grounded solely in prompt engineering. Its key contributions are: (1) an adaptive hybrid prompting mechanism that jointly leverages bounding boxes and center points; (2) the Adaptive Pattern Generation (APG) algorithm, which efficiently mines discriminative visual patterns from Class Activation Maps (CAMs) to generate high-quality pseudo-masks end-to-end; and (3) a streamlined inference chain—“localization network → CAM → APG → SAM”—that requires no model adaptation. Evaluated on high-resolution aerial and satellite datasets, our method achieves ≥3.0% improvement in F1-score and a 3.69% gain in IoU over state-of-the-art weakly supervised approaches.

Technology Category

Application Category

📝 Abstract
Weakly supervised landslide extraction aims to identify landslide regions from remote sensing data using models trained with weak labels, particularly image-level labels. However, it is often challenged by the imprecise boundaries of the extracted objects due to the lack of pixel-wise supervision and the properties of landslide objects. To tackle these issues, we propose a simple yet effective method by auto-prompting the Segment Anything Model (SAM), i.e., APSAM. Instead of depending on high-quality class activation maps (CAMs) for pseudo-labeling or fine-tuning SAM, our method directly yields fine-grained segmentation masks from SAM inference through prompt engineering. Specifically, it adaptively generates hybrid prompts from the CAMs obtained by an object localization network. To provide sufficient information for SAM prompting, an adaptive prompt generation (APG) algorithm is designed to fully leverage the visual patterns of CAMs, enabling the efficient generation of pseudo-masks for landslide extraction. These informative prompts are able to identify the extent of landslide areas (box prompts) and denote the centers of landslide objects (point prompts), guiding SAM in landslide segmentation. Experimental results on high-resolution aerial and satellite datasets demonstrate the effectiveness of our method, achieving improvements of at least 3.0% in F1 score and 3.69% in IoU compared to other state-of-the-art methods. The source codes and datasets will be available at https://github.com/zxk688.
Problem

Research questions and friction points this paper is trying to address.

Weak Supervision
Landslide Detection
Label Precision
Innovation

Methods, ideas, or system contributions that make the work stand out.

APSAM
Weakly-Supervised Learning
Landslide Detection
🔎 Similar Papers
No similar papers found.
J
Jian Wang
School of Information Science and Engineering, Wuhan University of Science and Technology, Wuhan 430081, China
X
Xiaokang Zhang
School of Information Science and Engineering, Wuhan University of Science and Technology, Wuhan 430081, China
X
Xianping Ma
School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
Weikang Yu
Weikang Yu
PhD Researcher, HZDR & Technical University of Munich
Remote sensingDeep learning
Pedram Ghamisi
Pedram Ghamisi
HZDR & Lancaster University, Group Leader and Professor
Earth ObservationDeep LearningAI4EOResponsible AIRemote Sensing