A Lyapunov-Guided Diffusion-Based Reinforcement Learning Approach for UAV-Assisted Vehicular Networks with Delayed CSI Feedback

📅 2025-07-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the problem of optimizing communication rate in UAV-assisted vehicular networks under channel state information (CSI) feedback delay and long-term energy constraints. To this end, we propose a joint resource management framework integrating Lyapunov optimization with diffusion models, enabling slot-level coordinated decisions on channel allocation, transmit power, and UAV flight altitude. Our key contribution is a novel deep deterministic policy gradient (D3PG) algorithm incorporating a generative diffusion prior, which decomposes the global energy constraint into instantaneous, tractable local control objectives while ensuring long-term UAV energy sustainability. Extensive simulations based on real-world vehicle trajectories demonstrate that the proposed method significantly outperforms existing baselines in both aggregate communication rate and energy efficiency. The framework establishes a scalable, intelligent resource orchestration paradigm for highly dynamic air-ground integrated vehicular communications in the emerging low-altitude economy.

Technology Category

Application Category

📝 Abstract
Low altitude uncrewed aerial vehicles (UAVs) are expected to facilitate the development of aerial-ground integrated intelligent transportation systems and unlocking the potential of the emerging low-altitude economy. However, several critical challenges persist, including the dynamic optimization of network resources and UAV trajectories, limited UAV endurance, and imperfect channel state information (CSI). In this paper, we offer new insights into low-altitude economy networking by exploring intelligent UAV-assisted vehicle-to-everything communication strategies aligned with UAV energy efficiency. Particularly, we formulate an optimization problem of joint channel allocation, power control, and flight altitude adjustment in UAV-assisted vehicular networks. Taking CSI feedback delay into account, our objective is to maximize the vehicle-to-UAV communication sum rate while satisfying the UAV's long-term energy constraint. To this end, we first leverage Lyapunov optimization to decompose the original long-term problem into a series of per-slot deterministic subproblems. We then propose a diffusion-based deep deterministic policy gradient (D3PG) algorithm, which innovatively integrates diffusion models to determine optimal channel allocation, power control, and flight altitude adjustment decisions. Through extensive simulations using real-world vehicle mobility traces, we demonstrate the superior performance of the proposed D3PG algorithm compared to existing benchmark solutions.
Problem

Research questions and friction points this paper is trying to address.

Optimize UAV-assisted vehicular networks with delayed CSI feedback
Maximize communication sum rate under UAV energy constraints
Develop diffusion-based reinforcement learning for resource and trajectory optimization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Lyapunov optimization for problem decomposition
Diffusion-based D3PG algorithm for decision-making
Joint channel, power, and altitude optimization
🔎 Similar Papers
No similar papers found.
Zhang Liu
Zhang Liu
University of Colorado Boulder
Distributed SystemsNetworkingCloud ComputingStorage
L
Lianfen Huang
School of Information Science and Technology, Xiamen University Tan Kah Kee College, and Department of Informatics and Communication Engineering, Xiamen University, Fujian 361102, China
Z
Zhibin Gao
Navigation Institute, Jimei University, Xiamen, Fujian, China 361021
X
Xianbin Wang
Department of Electrical and Computer Engineering, Western University, London, Ontario N6A 5B9, Canada
D
Dusit Niyato
College of Computing and Data Science, Nanyang Technological University, Singapore
X
Xuemin Shen
Department of Electrical and Computer Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada