π€ AI Summary
To address routing challenges in vehicular ad hoc networks (VANETs) under dynamic topologies, partial observability, and edge resource constraints, this paper proposes a lightweight reinforcement learning framework tailored for edge AI deployment. Methodologically, it innovatively integrates action-space pruning, graph cross-attention mechanisms, and trajectory-aware prediction to achieve strong generalization against incomplete observations and rapid topology changes; additionally, it incorporates intersection-level semantic information and historical trajectory modeling to enable cross-city map transferability. Experimental results on real-world urban maps demonstrate that the approach achieves near-optimal shortest-path performance, significantly outperforms baseline methods in packet delivery ratio, reduces inference latency by 42%, and compresses the model size to 1.8 MBβenabling efficient deployment on resource-constrained embedded edge devices.
π Abstract
Vehicular ad hoc networks (VANETs) are a crucial component of intelligent transportation systems; however, routing remains challenging due to dynamic topologies, incomplete observations, and the limited resources of edge devices. Existing reinforcement learning (RL) approaches often assume fixed graph structures and require retraining when network conditions change, making them unsuitable for deployment on constrained hardware. We present TrajAware, an RL-based framework designed for edge AI deployment in VANETs. TrajAware integrates three components: (i) action space pruning, which reduces redundant neighbour options while preserving two-hop reachability, alleviating the curse of dimensionality; (ii) graph cross-attention, which maps pruned neighbours to the global graph context, producing features that generalise across diverse network sizes; and (iii) trajectory-aware prediction, which uses historical routes and junction information to estimate real-time positions under partial observations. We evaluate TrajAware in the open-source SUMO simulator using real-world city maps with a leave-one-city-out setup. Results show that TrajAware achieves near-shortest paths and high delivery ratios while maintaining efficiency suitable for constrained edge devices, outperforming state-of-the-art baselines in both full and partial observation scenarios.