A YOLO-Based Semi-Automated Labeling Approach to Improve Fault Detection Efficiency in Railroad Videos

πŸ“… 2025-04-01
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the high time consumption, error rate, and cost associated with manual annotation in railway video fault detection, this paper proposes a lightweight iterative YOLO-based semi-automatic annotation method. Starting from a small set of initial annotations, the method employs a closed-loop optimization cycle comprising model training, inference, and human correction. It supports both textual export of detection results and interactive, rapid annotation refinement. Unlike proprietary black-box commercial platforms, our open-source, lightweight solution ensures full transparency and ease of deployment. Experimental results show that per-image annotation time is reduced from 2–4 minutes to 30 seconds–2 minutes, while annotation error rates decrease significantly. Consequently, both human effort and temporal overhead are substantially reduced. The proposed framework establishes an efficient, scalable, and reproducible annotation paradigm for constructing large-scale railway fault detection datasets.

Technology Category

Application Category

πŸ“ Abstract
Manual labeling for large-scale image and video datasets is often time-intensive, error-prone, and costly, posing a significant barrier to efficient machine learning workflows in fault detection from railroad videos. This study introduces a semi-automated labeling method that utilizes a pre-trained You Only Look Once (YOLO) model to streamline the labeling process and enhance fault detection accuracy in railroad videos. By initiating the process with a small set of manually labeled data, our approach iteratively trains the YOLO model, using each cycle's output to improve model accuracy and progressively reduce the need for human intervention. To facilitate easy correction of model predictions, we developed a system to export YOLO's detection data as an editable text file, enabling rapid adjustments when detections require refinement. This approach decreases labeling time from an average of 2 to 4 minutes per image to 30 seconds to 2 minutes, effectively minimizing labor costs and labeling errors. Unlike costly AI based labeling solutions on paid platforms, our method provides a cost-effective alternative for researchers and practitioners handling large datasets in fault detection and other detection based machine learning applications.
Problem

Research questions and friction points this paper is trying to address.

Reducing manual labeling time for railroad video datasets
Improving fault detection accuracy with semi-automated YOLO
Providing cost-effective labeling alternative for large datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

YOLO-based semi-automated labeling for railroad videos
Iterative training with minimal manual intervention
Editable text export for rapid prediction correction
πŸ”Ž Similar Papers
No similar papers found.
D
Dylan Lester
Department of Computer Sciences and Electrical Engineering (CSEE), and Perception Intelligence Networks Group (PiNG), Marshall University
J
James Gao
Department of Computer Sciences and Electrical Engineering (CSEE), and Perception Intelligence Networks Group (PiNG), Marshall University
S
Samuel Sutphin
Department of Computer Sciences and Electrical Engineering (CSEE), and Perception Intelligence Networks Group (PiNG), Marshall University
Pingping Zhu
Pingping Zhu
Assistant Professor, Marshall University
Signal ProcessingMachine LearningNeural NetworksIntelligent ControlMulti-Agent Systems
H
Husnu Narman
Department of Computer Sciences and Electrical Engineering (CSEE), Marshall University
Ammar Alzarrad
Ammar Alzarrad
Marshall University
Engineering Management