Adapting SAM with Dynamic Similarity Graphs for Few-Shot Parameter-Efficient Small Dense Object Detection: A Case Study of Chickpea Pods in Field Conditions

📅 2025-09-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of scarce training data and complex field conditions hindering efficient fine-tuning of Segment Anything Model (SAM) in agricultural settings, this paper proposes Dynamic Similarity Graph Adaptation (DSGA), tailored for foreground and instance segmentation of small, densely packed crop targets—e.g., chickpea pods. DSGA integrates dynamic similarity graph construction, learnable weight ranking, adaptive local feature aggregation, and LoRA-based low-rank parameter updates, jointly modeling global dependencies and local details with only 4.00M trainable parameters. A learnable polynomial decay initialization is introduced to enhance convergence stability, while Grad-CAM and t-SNE enable interpretable analysis. On the chickpea pod dataset, DSGA achieves a 17.31% improvement in structural measure and a 62.36% gain in adaptive F-measure under 2–10-shot settings, with counting correlation reaching an adjusted R² of 0.8987—substantially outperforming existing PEFT methods.

Technology Category

Application Category

📝 Abstract
Parameter-Efficient Fine-Tuning (PEFT) of foundation models for agricultural computer vision tasks remains challenging due to limited training data and complex field conditions. This study introduces a Dynamic Similarity-based Graph Adaptation (DSGA) module to adapt the Segment Anything Model (SAM) under extreme data constraints for precise foreground and instance segmentation of small dense objects in complex agricultural environments. Through dynamic similarity graph construction with a learnable polynomial decay-initialized weight ranking mechanism and adaptive local feature aggregation, DSGA establishes robust spatial and dynamic similarity representation with only 4.00M trainable parameters, which is 4.26% of the original SAM. Integrating this graph-based feature adaptation with Low-Rank Adaptation (LoRA) creates a complementary optimization framework that effectively captures both local and global dependencies in image embeddings while preserving model stability and parameter efficiency. Experimental results on a challenging chickpea pod dataset demonstrated that DSGA with LoRA achieved superior performance across multiple metrics evaluated under 2, 4, 8 and 10 shots, with progressive performance gains as shot count increased. Quantitative metrics showed a 17.31% improvement in Structure-measure and a 62.36% gain in adaptive F-measure compared to the baseline SAM fine-tuning. Comprehensive ablation studies and visualization analyses through Grad-CAM and t-SNE validated the framework's effectiveness in feature discrimination. The proposed adaptation demonstrated practical utility for automated agricultural monitoring applications, achieving accurate pod-counting with an adjusted R-squared of 0.8987 for images with 10 to 120 pods under challenging field conditions.
Problem

Research questions and friction points this paper is trying to address.

Adapting foundation models for small dense object detection with limited data
Achieving precise segmentation in complex agricultural field conditions
Maintaining parameter efficiency while improving segmentation accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic Similarity Graph Adaptation module enhances SAM
Integrates graph adaptation with Low-Rank Adaptation for optimization
Achieves parameter efficiency with only 4M trainable parameters
🔎 Similar Papers
No similar papers found.
X
Xintong Jiang
McGill University, Department of Bioresource Engineering, 21111 Lakeshore Road, Sainte-Anne-de-Bellevue, Quebec H9X 3V9, Canada
Y
Yixue Liu
McGill University, Department of Bioresource Engineering, 21111 Lakeshore Road, Sainte-Anne-de-Bellevue, Quebec H9X 3V9, Canada
M
Mohamed Debbagh
McGill University, Department of Bioresource Engineering, 21111 Lakeshore Road, Sainte-Anne-de-Bellevue, Quebec H9X 3V9, Canada
Y
Yu Tian
McGill University, Department of Bioresource Engineering, 21111 Lakeshore Road, Sainte-Anne-de-Bellevue, Quebec H9X 3V9, Canada
V
Valerio Hoyos-Villegas
Michigan State University, Department of Plant, Soil and Microbial Sciences, 426 Auditorium Road, East Lansing, Michigan 48824, United States
V
Viacheslav Adamchuk
McGill University, Department of Bioresource Engineering, 21111 Lakeshore Road, Sainte-Anne-de-Bellevue, Quebec H9X 3V9, Canada
Shangpeng Sun
Shangpeng Sun
Assistant Professor, McGill University, Canada
Digital agriculturePlant phenotypingMachine/Deep learning2D/3D computer visionRemote sensing