GPG-HT: Generalized Policy Gradient with History-Aware Decision Transformer for Probabilistic Path Planning

📅 2025-08-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Reliable path planning in urban traffic is challenged by the coexistence of stochasticity and spatiotemporal dependencies. Method: This paper proposes a novel approach integrating a history-aware Decision Transformer with a Generalized Policy Gradient (GPG) framework. It is the first to adapt the Decision Transformer to traffic routing—explicitly modeling long-range temporal dependencies and state uncertainty in traffic flow—while leveraging GPG for end-to-end policy optimization enabling probabilistic path inference from historical traffic sequences. Results: Experiments on the Sioux Falls network demonstrate that the method significantly improves on-time arrival probability, outperforming existing deterministic and time-varying models. It achieves both high decision accuracy and robustness in complex, stochastic road networks.

Technology Category

Application Category

📝 Abstract
With the rapidly increased number of vehicles in urban areas, existing road infrastructure struggles to accommodate modern traffic demands, resulting in the issue of congestion. This highlights the importance of efficient path planning strategies. However, most recent navigation models focus solely on deterministic or time-dependent networks, while overlooking the correlations and the stochastic nature of traffic flows. In this work, we address the reliable shortest path problem within stochastic transportation networks under certain dependencies. We propose a path planning solution that integrates the decision Transformer with the Generalized Policy Gradient (GPG) framework. Based on the decision Transformer's capability to model long-term dependencies, our proposed solution improves the accuracy and stability of path decisions. Experimental results on the Sioux Falls Network (SFN) demonstrate that our approach outperforms previous baselines in terms of on-time arrival probability, providing more accurate path planning solutions.
Problem

Research questions and friction points this paper is trying to address.

Addresses reliable shortest path in stochastic transportation networks
Integrates decision Transformer with Generalized Policy Gradient framework
Improves accuracy and stability of path planning decisions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generalized Policy Gradient with Decision Transformer integration
Models long-term dependencies for accurate path planning
Improves on-time arrival probability in stochastic networks
🔎 Similar Papers
No similar papers found.
X
Xing Wei
Sichuan University, Chengdu, China
Yuqi Ouyang
Yuqi Ouyang
Sichuan University
Computer Vision