GCA-SUNet: A Gated Context-Aware Swin-UNet for Exemplar-Free Counting

๐Ÿ“… 2024-09-18
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This paper addresses the exemplar-free object counting taskโ€”estimating object counts without category annotations or example images. We propose an end-to-end density map regression framework. Methodologically, we introduce (1) a novel gated context-aware modulation module that jointly models intra-object self-similarity attention; (2) collaborative integration of gating mechanisms into the Swin Transformer encoder, bottleneck layer, and decoder to dynamically suppress background interference; and (3) a self-similarity-guided feature enhancement strategy to improve cross-scene generalization. Evaluated on real-world benchmarks including FSC-147 and CARPK, our approach achieves significant improvements over existing state-of-the-art methods, delivering both higher accuracy and stronger generalization. The framework establishes a new paradigm for open-environment object counting, eliminating reliance on class labels or exemplar images while maintaining robustness across diverse scenes.

Technology Category

Application Category

๐Ÿ“ Abstract
Exemplar-Free Counting aims to count objects of interest without intensive annotations of objects or exemplars. To achieve this, we propose a Gated Context-Aware Swin-UNet (GCA-SUNet) to directly map an input image to the density map of countable objects. Specifically, a set of Swin transformers form an encoder to derive a robust feature representation, and a Gated Context-Aware Modulation block is designed to suppress irrelevant objects or background through a gate mechanism and exploit the attentive support of objects of interest through a self-similarity matrix. The gate strategy is also incorporated into the bottleneck network and the decoder of the Swin-UNet to highlight the features most relevant to objects of interest. By explicitly exploiting the attentive support among countable objects and eliminating irrelevant features through the gate mechanisms, the proposed GCA-SUNet focuses on and counts objects of interest without relying on predefined categories or exemplars. Experimental results on the real-world datasets such as FSC-147 and CARPK demonstrate that GCA-SUNet significantly and consistently outperforms state-of-the-art methods. The code is available at https://github.com/Amordia/GCA-SUNet.
Problem

Research questions and friction points this paper is trying to address.

Count objects without exemplars or annotations
Map input images to density maps directly
Suppress irrelevant features using gated mechanisms
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Swin transformers for robust feature encoding
Implements Gated Context-Aware Modulation block
Incorporates gate mechanism in bottleneck and decoder
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yuzhe Wu
School of Computer Science, University of Nottingham Ningbo China
Y
Yipeng Xu
School of Computer Science, University of Nottingham Ningbo China
T
Tianyu Xu
School of Computer Science, University of Nottingham Ningbo China
J
Jialu Zhang
School of Computer Science, University of Nottingham Ningbo China
Jianfeng Ren
Jianfeng Ren
University of Nottingham Ningbo China
Computer VisionPattern RecognitionMachine LearningHuman-Computer Interaction
X
Xudong Jiang
School of Electrical & Electronic Engineering, Nanyang Technological University, Singapore