Hybrid LLM-DDQN based Joint Optimization of V2I Communication and Autonomous Driving

📅 2024-10-11
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenges of tight coupling, high latency, and frequent handovers between V2I communication and autonomous driving decision-making in cooperative vehicle-infrastructure systems. We propose a joint optimization framework integrating Large Language Models (LLMs) and Deep Double Q-Networks (DDQN) in an iterative manner. Our method encodes LLM-generated semantic driving policies as structured V2I state inputs to enable cross-modal policy coordination; introduces a Euclidean distance–driven historical experience retrieval mechanism to enhance LLM’s online adaptability; and constructs a unified state space with an iterative policy optimization pipeline. Experimental results demonstrate that, compared to a pure DDQN baseline, our approach achieves a 37% faster convergence rate and a 22% higher average reward, while significantly improving communication stability, data throughput, and driving safety.

Technology Category

Application Category

📝 Abstract
Large language models (LLMs) have received considerable interest recently due to their outstanding reasoning and comprehension capabilities. This work explores applying LLMs to vehicular networks, aiming to jointly optimize vehicle-to-infrastructure (V2I) communications and autonomous driving (AD) policies. We deploy LLMs for AD decision-making to maximize traffic flow and avoid collisions for road safety, and a double deep Q-learning algorithm (DDQN) is used for V2I optimization to maximize the received data rate and reduce frequent handovers. In particular, for LLM-enabled AD, we employ the Euclidean distance to identify previously explored AD experiences, and then LLMs can learn from past good and bad decisions for further improvement. Then, LLM-based AD decisions will become part of states in V2I problems, and DDQN will optimize the V2I decisions accordingly. After that, the AD and V2I decisions are iteratively optimized until convergence. Such an iterative optimization approach can better explore the interactions between LLMs and conventional reinforcement learning techniques, revealing the potential of using LLMs for network optimization and management. Finally, the simulations demonstrate that our proposed hybrid LLM-DDQN approach outperforms the conventional DDQN algorithm, showing faster convergence and higher average rewards.
Problem

Research questions and friction points this paper is trying to address.

V2I Optimization
Autonomous Driving
Network Efficiency
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-DDQN
V2I-Communication-Optimization
Autonomous-Driving-Decision-Enhancement
🔎 Similar Papers
No similar papers found.
Zijiang Yan
Zijiang Yan
York University
LLMQuantum Machine LearningDiffusion ModelAIAutonomous Driving
H
Hao Zhou
Department of Computer Science, McGill University, Montréal, QC, H3A 0G4, Canada
Hina Tabassum
Hina Tabassum
Associate Professor, York University
6GDeep LearningResource AllocationPerformance Optimization
X
Xue Liu
Department of Computer Science, McGill University, Montréal, QC, H3A 0G4, Canada