Boost UAV-based Ojbect Detection via Scale-Invariant Feature Disentanglement and Adversarial Learning

📅 2024-05-24

📈 Citations: 0

✨ Influential: 0

career value

191K/year

🤖 AI Summary

To address the low accuracy and poor real-time performance of small-object detection in UAV imagery, this paper proposes a single-stage high-precision detection framework. First, we design a Scale-Invariant Feature Decoupling (SIFD) module that explicitly separates scale-dependent and scale-invariant features. Second, we introduce an adversarial feature learning mechanism to enhance the robustness of feature decoupling. Third, we construct State-Air—the first multimodal UAV dataset incorporating flight-control state parameters. Our method is built upon lightweight single-stage detectors (YOLOv5/v8/PP-YOLOE) and optimized end-to-end. It achieves state-of-the-art (SOTA) performance on both public and in-house datasets, with significant improvements in small-object AP while maintaining real-time inference speed (>30 FPS). The source code and the State-Air dataset will be publicly released.

Technology Category

Application Category

📝 Abstract

Detecting objects from Unmanned Aerial Vehicles (UAV) is often hindered by a large number of small objects, resulting in low detection accuracy. To address this issue, mainstream approaches typically utilize multi-stage inferences. Despite their remarkable detecting accuracies, real-time efficiency is sacrificed, making them less practical to handle real applications. To this end, we propose to improve the single-stage inference accuracy through learning scale-invariant features. Specifically, a Scale-Invariant Feature Disentangling module is designed to disentangle scale-related and scale-invariant features. Then an Adversarial Feature Learning scheme is employed to enhance disentanglement. Finally, scale-invariant features are leveraged for robust UAV-based object detection. Furthermore, we construct a multi-modal UAV object detection dataset, State-Air, which incorporates annotated UAV state parameters. We apply our approach to three lightweight detection frameworks on two benchmark datasets. Extensive experiments demonstrate that our approach can effectively improve model accuracy and achieve state-of-the-art (SoTA) performance on two datasets. Our code and dataset will be publicly available once the paper is accepted.

Problem

Research questions and friction points this paper is trying to address.

Drone Object Recognition

Accuracy and Efficiency

Small Object Detection

Innovation

Methods, ideas, or system contributions that make the work stand out.

invariant feature analysis

optimised learning strategy

State-Air dataset

🔎 Similar Papers

Model Agnostic Defense against Adversarial Patch Attacks on Object Detection in Unmanned Aerial Vehicles