Applications of deep reinforcement learning to urban transit network design

📅 2025-02-25

📈 Citations: 0

✨ Influential: 0

career value

190K/year

🤖 AI Summary

This study addresses the urban public transit network design optimization problem. We propose a synergistic framework integrating deep reinforcement learning (DRL) with metaheuristic algorithms. Specifically, route planning is formulated as a Markov decision process, and a neural policy network is trained to generate high-quality candidate routes. Innovatively, the DRL policy is embedded into both genetic algorithm (GA) and ant colony optimization (ACO) frameworks—guiding population evolution in GA and pheromone update in ACO. Empirical evaluation on Laval, Quebec, demonstrates that the resulting network fully satisfies all travel demand while reducing average passenger travel time by 12.3% and operational cost by 8.7%. The hybrid approach significantly outperforms standalone DRL or conventional metaheuristics, validating the effectiveness and practicality of the neuro-symbolic co-optimization paradigm.

Technology Category

Application Category

📝 Abstract

This thesis concerns the use of reinforcement learning to train neural networks to aid in the design of public transit networks. The Transit Network Design Problem (TNDP) is an optimization problem of considerable practical importance. Given a city with an existing road network and travel demands, the goal is to find a set of transit routes - each of which is a path through the graph - that collectively satisfy all demands, while minimizing a cost function that may depend both on passenger satisfaction and operating costs. The existing literature on this problem mainly considers metaheuristic optimization algorithms, such as genetic algorithms and ant-colony optimization. By contrast, we begin by taking a reinforcement learning approach, formulating the construction of a set of transit routes as a Markov Decision Process (MDP) and training a neural net policy to act as the agent in this MDP. We then show that, beyond using this policy to plan a transit network directly, it can be combined with existing metaheuristic algorithms, both to initialize the solution and to suggest promising moves at each step of a search through solution space. We find that such hybrid algorithms, which use a neural policy trained via reinforcement learning as a core component within a classical metaheuristic framework, can plan transit networks that are superior to those planned by either the neural policy or the metaheuristic algorithm. We demonstrate the utility of our approach by using it to redesign the transit network for the city of Laval, Quebec, and show that in simulation, the resulting transit network provides better service at lower cost than the existing transit network.

Problem

Research questions and friction points this paper is trying to address.

Optimizing public transit network design

Combining reinforcement learning with metaheuristics

Improving service and reducing costs in urban transit

Innovation

Methods, ideas, or system contributions that make the work stand out.

Reinforcement learning trains neural networks

Formulates transit routes as MDP

Hybrid algorithms enhance metaheuristic frameworks

🔎 Similar Papers

Learning Heuristics for Transit Network Design and Improvement with Deep Reinforcement Learning