🤖 AI Summary
To address low inspection efficiency and insufficient recognition accuracy of critical components in power grid巡检, this work proposes a Transformer-based 3D semantic segmentation method—first systematically applied to LiDAR point cloud-based power line inspection. Built upon the densely annotated TS40K dataset, our approach integrates robust data augmentation with a Transformer architecture specifically designed to handle point cloud class imbalance and noise, enabling end-to-end, component-level 3D semantic understanding of transmission lines, towers, and other infrastructure elements. Experiments demonstrate an IoU of 95.53% for transmission line detection, substantially outperforming existing methods. This work bridges the gap in intelligent power infrastructure perception—from image-level analysis to fine-grained, component-aware 3D understanding—and provides a practical technical foundation for proactive risk预警 and autonomous grid operation and maintenance.
📝 Abstract
Ensuring the safety and reliability of power grids is critical as global energy demands continue to rise. Traditional inspection methods, such as manual observations or helicopter surveys, are resource-intensive and lack scalability. This paper explores the use of 3D computer vision to automate power grid inspections, utilizing the TS40K dataset -- a high-density, annotated collection of 3D LiDAR point clouds. By concentrating on 3D semantic segmentation, our approach addresses challenges like class imbalance and noisy data to enhance the detection of critical grid components such as power lines and towers. The benchmark results indicate significant performance improvements, with IoU scores reaching 95.53% for the detection of power lines using transformer-based models. Our findings illustrate the potential for integrating ML into grid maintenance workflows, increasing efficiency and enabling proactive risk management strategies.