Deep Learning Based Concurrency Bug Detection and Localization

📅 2025-08-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Concurrency bugs—arising from improper synchronization of shared resources—pose severe reliability threats to multithreaded and distributed systems, yet remain notoriously difficult to detect and localize precisely. This paper introduces the first deep learning framework for concurrent bug detection and code-level precise localization. First, we construct a large-scale, diverse, and domain-specific dataset of concurrency bugs. Second, we design a concurrency-aware Code Property Graph (CCPG) and an associated heterogeneous Graph Neural Network (GNN) that explicitly models critical semantic features, including thread interactions, lock ordering, and data races. Third, we integrate SubgraphX, a model-agnostic explainability technique, to enable end-to-end transition from binary classification to line-level fault localization. Experimental evaluation demonstrates that our approach achieves average improvements of 10% in accuracy and precision, and 26% in recall over state-of-the-art methods.

Technology Category

Application Category

📝 Abstract
Concurrency bugs, caused by improper synchronization of shared resources in multi-threaded or distributed systems, are notoriously hard to detect and thus compromise software reliability and security. The existing deep learning methods face three main limitations. First, there is an absence of large and dedicated datasets of diverse concurrency bugs for them. Second, they lack sufficient representation of concurrency semantics. Third, binary classification results fail to provide finer-grained debug information such as precise bug lines. To address these problems, we propose a novel method for effective concurrency bug detection as well as localization. We construct a dedicated concurrency bug dataset to facilitate model training and evaluation. We then integrate a pre-trained model with a heterogeneous graph neural network (GNN), by incorporating a new Concurrency-Aware Code Property Graph (CCPG) that concisely and effectively characterizes concurrency semantics. To further facilitate debugging, we employ SubgraphX, a GNN-based interpretability method, which explores the graphs to precisely localize concurrency bugs, mapping them to specific lines of source code. On average, our method demonstrates an improvement of 10% in accuracy and precision and 26% in recall compared to state-of-the-art methods across diverse evaluation settings.
Problem

Research questions and friction points this paper is trying to address.

Detecting concurrency bugs in multi-threaded systems
Localizing bugs to specific source code lines
Addressing limitations in semantic representation and datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Constructs dedicated concurrency bug dataset
Integrates pre-trained model with GNN
Employs SubgraphX for precise bug localization
🔎 Similar Papers
No similar papers found.
Z
Zuocheng Feng
The Key Laboratory of Embedded System and Service Computing of Ministry of Education, Tongji University, 201804, Shanghai, China
Kaiwen Zhang
Kaiwen Zhang
Associate Professor, Software/IT Engineering, École de technologie supérieure
blockchainsdistributed systemspublish/subscribeonline gamesmiddleware
Miaomiao Wang
Miaomiao Wang
IBM
CMOS Technology
Yiming Cheng
Yiming Cheng
Tsinghua University
machine learningnetwork systemsdata miningrecommendation systems
Y
Yuandao Cai
Hong Kong University of Science and Technology, Hong Kong University, 999077, Hong Kong, China
X
Xiaofeng Li
The Space Optoelectronic Measurement and Perception Lab, Beijing Institute of Control Engineering, Street, 100086, Beijing, China
G
Guanjun Liu
The Key Laboratory of Embedded System and Service Computing of Ministry of Education, Tongji University, 201804, Shanghai, China