Attention-Guided Autoencoder Fusion for Insulator Defect Detection Using UAV Transmission-Line Imaging

📅 2026-06-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges of severe class imbalance, multi-scale defect variations, and low spatial occupancy in insulator defect detection for drone-based inspection by proposing the AE-YOLO framework. The method integrates a lightweight autoencoder into the FPN-PAN neck, enhanced with CBAM attention and a variance-maximization regularization strategy to heighten sensitivity to anomalous features during multi-scale fusion. An attention-guided confidence calibration mechanism is further introduced to refine detection reliability. Built upon an EfficientNetV2 backbone and optimized with Focal Loss, CIoU Loss, and Weighted Boxes Fusion (WBF), the model achieves 95.10% mAP@0.5, 96.40% precision, and 93.80% recall on an insulator defect dataset, surpassing the best YOLO baseline by 5.0 and 6.7 percentage points in mAP and recall, respectively.
📝 Abstract
Automated defect detection in high-voltage transmission-line insulators remains challenging due to severe class imbalance, large scale variation, and the small spatial extent of defect instances in Unmanned Aerial Vehicle (UAV) imagery. To address these challenges, this paper proposes AE-YOLO, an Attention-Guided AutoEncoder-Enhanced YOLO framework for robust insulator defect detection. The architecture integrates lightweight bottleneck autoencoders within a Feature Pyramid Network-Path Aggregation Network (FPN-PAN) neck. This preserves anomaly-sensitive information during multi-scale feature fusion. Convolutional Block Attention Modules (CBAM) are used throughout the backbone, enhancing feature discrimination and suppressing background interference. The framework also introduces a variance-maximizing autoencoder regularization strategy, which encourages diverse, defect-discriminative latent representations. The network trains using a unified objective that combines focal loss, Complete IoU (CIoU) loss, and autoencoder regularization to address foreground-background imbalance and improve localization accuracy. During inference, Weighted Boxes Fusion (WBF) combines predictions from YOLOv8, YOLOv10, and YOLO11. An autoencoder-guided confidence boosting mechanism improves sensitivity to rare defect categories. Experiments on the Insulator-Defect Detection dataset show that AE-YOLO with an EfficientNetV2 backbone achieves 95.10 percent mAP at 0.5, 96.40 percent precision, and 93.80 percent recall. This performance surpasses the strongest YOLO-family baseline by 5.0 points in mAP at 0.5 and 6.7 points in recall. These results confirm the effectiveness and adaptability of the framework. The model is a practical and scalable solution for UAV-based transmission-line inspection and defect monitoring.
Problem

Research questions and friction points this paper is trying to address.

insulator defect detection
UAV imagery
class imbalance
small object detection
transmission-line inspection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Attention-Guided Autoencoder
Defect Detection
Class Imbalance
Feature Pyramid Network
Variance-Maximizing Regularization
🔎 Similar Papers
No similar papers found.