URB -- Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles

📅 2025-05-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Standardized, realistic city-scale benchmarks for evaluating reinforcement learning (RL) in collective routing of connected and autonomous vehicles (CAVs) are currently lacking. This paper introduces the first large-scale, real-world multi-agent RL (MARL) routing benchmark for urban road networks, encompassing 29 real-world traffic topologies and dynamic origin-destination demand patterns. We propose a standardized, scalable evaluation framework integrating predefined tasks, four state-of-the-art MARL algorithms (e.g., MAPPO, QMix), three classes of baselines, and domain-specific metrics—including the first public MARL urban routing leaderboard. Leveraging SUMO-based simulation and empirically grounded network modeling, our experiments reveal that current SOTA methods still underperform human-engineered routing heuristics in realistic city settings, exposing critical scalability limitations. These findings provide concrete guidance for advancing distributed, cooperative routing algorithms in urban CAV systems.

Technology Category

Application Category

📝 Abstract
Connected Autonomous Vehicles (CAVs) promise to reduce congestion in future urban networks, potentially by optimizing their routing decisions. Unlike for human drivers, these decisions can be made with collective, data-driven policies, developed by machine learning algorithms. Reinforcement learning (RL) can facilitate the development of such collective routing strategies, yet standardized and realistic benchmarks are missing. To that end, we present our{}: Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles. our{} is a comprehensive benchmarking environment that unifies evaluation across 29 real-world traffic networks paired with realistic demand patterns. our{} comes with a catalog of predefined tasks, four state-of-the-art multi-agent RL (MARL) algorithm implementations, three baseline methods, domain-specific performance metrics, and a modular configuration scheme. Our results suggest that, despite the lengthy and costly training, state-of-the-art MARL algorithms rarely outperformed humans. Experimental results reported in this paper initiate the first leaderboard for MARL in large-scale urban routing optimization and reveal that current approaches struggle to scale, emphasizing the urgent need for advancements in this domain.
Problem

Research questions and friction points this paper is trying to address.

Lack of standardized benchmarks for RL in urban CAV routing
Need for scalable MARL algorithms to outperform human drivers
Absence of unified evaluation metrics for large-scale traffic networks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Benchmark for RL-equipped CAVs routing
Unifies 29 real-world traffic networks
Includes MARL algorithms and baselines
🔎 Similar Papers
No similar papers found.
A
A. Akman
Faculty of Mathematics and Computer Science, Jagiellonian University, Kraków, Poland
Anastasia Psarou
Anastasia Psarou
Jagiellonian University
Reinforcement learning
M
Michal Hoffmann
Faculty of Mathematics and Computer Science, Jagiellonian University, Kraków, Poland
L
Lukasz Gorczyca
Faculty of Mathematics and Computer Science, Jagiellonian University, Kraków, Poland
L
Lukasz Kowalski
Urban Policy Observatory, Institute of Urban and Regional Development, Warsaw, Poland
G
Grzegorz Jamr'oz
Faculty of Mathematics and Computer Science, Jagiellonian University, Kraków, Poland
Rafał Kucharski
Rafał Kucharski
Jagiellonian University - Group of Machine Learning Methods
urban mobilitytransportation researchreinforcement learninggame theoryuser equilibrium