🤖 AI Summary
This study addresses the challenge of collaborative path optimization for autonomous vehicles (AVs) in mixed traffic environments with human-driven vehicles. We propose a distributed dynamic routing framework that integrates multi-agent reinforcement learning (MARL) with microscopic traffic simulation. Our approach uniquely co-simulates human driver behavior—modeled via a Logit model—with AV policies trained using MAPPO, enabling, for the first time, collaborative path optimization in human-AI mixed traffic flows. The framework is implemented within the SUMO simulation platform, supporting real-time, decentralized decision-making for AVs on realistic urban road networks. Experimental results on a representative city-scale network demonstrate a 12.3% reduction in global average travel time and a 19.7% improvement in individual AV throughput. These findings validate the effectiveness, robustness, and scalability of our distributed collaborative routing approach in complex, heterogeneous traffic scenarios.
📝 Abstract
RouteRL is a novel framework that integrates multi-agent reinforcement learning (MARL) with a microscopic traffic simulation, facilitating the testing and development of efficient route choice strategies for autonomous vehicles (AVs). The proposed framework simulates the daily route choices of driver agents in a city, including two types: human drivers, emulated using behavioral route choice models, and AVs, modeled as MARL agents optimizing their policies for a predefined objective. RouteRL aims to advance research in MARL, transport modeling, and human-AI interaction for transportation applications. This study presents a technical report on RouteRL, outlines its potential research contributions, and showcases its impact via illustrative examples.