Quantum-Inspired Reinforcement Learning for Low-Latency Intrusion Detection in V2X and Internet-of-Vehicles Networks

📅 2026-06-05

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This study addresses the limitations of traditional static defenses in vehicular networks, which fail to counter dynamic, multi-stage cyberattacks due to insufficient adaptability and high response latency. To overcome these challenges, the authors propose the QIRL framework, which uniquely integrates amplitude-phase quantum state encoding, a rotation gate-based exploration mechanism, and quantum interference-enhanced reward shaping into vehicular network security. The approach combines a lightweight deep Q-network with a cost-sensitive Markov decision process to model temporal attack dependencies and incorporates SMOTE oversampling to mitigate class imbalance. Evaluated on the CICIDS2017 and UNSW-NB15 datasets, QIRL achieves accuracy rates of 97.89% and 91.04%, F1 scores of 95.22% and 91.66%, and AUC-ROC values of 0.9945 and 0.9713, respectively, with per-sample inference latencies of only 32.5 and 45.7 microseconds—over 50 times faster than baseline methods.

📝 Abstract

Smart cities increasingly depend on dense edge, IoT, and vehicular networks to deliver critical urban services, including traffic control, connected mobility, infrastructure monitoring, and energy management. In this ecosystem, the Internet of Vehicles (IoV) is central to intelligent transportation, enabling continuous communication among vehicles, roadside infrastructure, and cloud-edge platforms. This connectivity, however, also enlarges the attack surface and exposes smart city and vehicular systems to evolving cyber threats that can compromise safety, privacy, data integrity, and service continuity. Conventional static defenses are often inadequate because they cannot autonomously adapt to changing attack behaviors or multi-stage intrusion patterns. This paper proposes QIRL, a Quantum-Inspired Reinforcement Learning framework built on a lightweight Deep Q-Network architecture for next-generation autonomous cyber defense. QIRL combines amplitude-phase quantum state encoding, rotation-gate-based exploration, and quantum interference reward augmentation within a cost-sensitive Markov Decision Process formulation. It further addresses class imbalance through training-only SMOTE balancing and asymmetric cost-sensitive reward shaping, while sequential MDP modeling captures temporal dependencies in multi-stage attack campaigns. The framework is evaluated on CICIDS2017 and UNSW-NB15. QIRL achieves accuracies of 97.89\% and 91.04\%, F1-scores of 95.22\% and 91.66\%, AUC-ROC values of 0.9945 and 0.9713, and True Skill Statistics of 0.9443 and 0.8244, respectively. It also attains ultra-low inference latencies of 32.5 and 45.7 microseconds per sample, corresponding to 67.77 times and 51.77 times speedups over ensemble baselines. These results show that QIRL offers a lightweight, latency-aware, and adaptive defense for smart city and IoV infrastructures.

Problem

Research questions and friction points this paper is trying to address.

Intrusion Detection

Internet-of-Vehicles

Cyber Threats

Low-Latency

Adaptive Defense

Innovation

Methods, ideas, or system contributions that make the work stand out.

Quantum-Inspired Reinforcement Learning

Low-Latency Intrusion Detection

Internet of Vehicles