🤖 AI Summary
This study addresses the limitation of conventional traffic signal control strategies that prioritize vehicular efficiency at the expense of multimodal equity. To this end, the authors propose the STDSH-MARL framework, a multi-agent reinforcement learning approach based on centralized training with decentralized execution. The method introduces a novel spatiotemporal two-stage hypergraph attention mechanism, which, for the first time in traffic signal control, explicitly models temporal hyperedges to capture complex interdependencies across time. Furthermore, it employs a hybrid discrete action space to jointly optimize signal phase selection and green duration allocation. Experimental results across five diverse scenarios demonstrate consistent and significant performance gains over state-of-the-art methods, with notable improvements in public transit prioritization. Ablation studies confirm that the incorporation of temporal hyperedges is a key contributor to the observed performance enhancement.
📝 Abstract
Human-centric traffic signal control in corridor networks must increasingly account for multimodal travelers, particularly high-occupancy public transportation, rather than focusing solely on vehicle-centric performance. This paper proposes STDSH-MARL (Spatio-Temporal Dual-Stage Hypergraph based Multi-Agent Reinforcement Learning), a scalable multi-agent deep reinforcement learning framework that follows a centralized training and decentralized execution paradigm. The proposed method captures spatio-temporal dependencies through a novel dual-stage hypergraph attention mechanism that models interactions across both spatial and temporal hyperedges. In addition, a hybrid discrete action space is introduced to jointly determine the next signal phase configuration and its corresponding green duration, enabling more adaptive signal timing decisions. Experiments conducted on a corridor network under five traffic scenarios demonstrate that STDSH-MARL consistently improves multimodal performance and provides clear benefits for public transportation priority. Compared with state-of-the-art baseline methods, the proposed approach achieves superior overall performance. Further ablation studies confirm the contribution of each component of STDSH-MARL, with temporal hyperedges identified as the most influential factor driving the observed performance gains.