CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting

๐Ÿ“… 2025-03-10
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
To address the joint requirements of accuracy, robustness, and real-time performance for autonomous driving motion prediction in complex traffic scenarios, this paper proposes the first lightweight motion prediction framework integrating large language models (LLMs) with chain-of-thought (CoT) prompting. Methodologically, it introduces: (1) a novel fine-tuning-free CoT semantic annotation generation paradigm that automatically produces high-quality traffic semantic labels; (2) knowledge distillation to transfer LLM-based scene understanding capabilities into an edge-deployable lightweight language model; and (3) Highway-Text and Urban-Textโ€”the first publicly available text datasets tailored for traffic scene description. Evaluated on five real-world benchmarks, our approach surpasses state-of-the-art methods, reducing prediction error by 12.6%โ€“23.4% while maintaining inference latency under 80 msโ€”meeting stringent real-time constraints for onboard edge devices.

Technology Category

Application Category

๐Ÿ“ Abstract
Accurate motion forecasting is crucial for safe autonomous driving (AD). This study proposes CoT-Drive, a novel approach that enhances motion forecasting by leveraging large language models (LLMs) and a chain-of-thought (CoT) prompting method. We introduce a teacher-student knowledge distillation strategy to effectively transfer LLMs' advanced scene understanding capabilities to lightweight language models (LMs), ensuring that CoT-Drive operates in real-time on edge devices while maintaining comprehensive scene understanding and generalization capabilities. By leveraging CoT prompting techniques for LLMs without additional training, CoT-Drive generates semantic annotations that significantly improve the understanding of complex traffic environments, thereby boosting the accuracy and robustness of predictions. Additionally, we present two new scene description datasets, Highway-Text and Urban-Text, designed for fine-tuning lightweight LMs to generate context-specific semantic annotations. Comprehensive evaluations of five real-world datasets demonstrate that CoT-Drive outperforms existing models, highlighting its effectiveness and efficiency in handling complex traffic scenarios. Overall, this study is the first to consider the practical application of LLMs in this field. It pioneers the training and use of a lightweight LLM surrogate for motion forecasting, setting a new benchmark and showcasing the potential of integrating LLMs into AD systems.
Problem

Research questions and friction points this paper is trying to address.

Enhance motion forecasting for autonomous driving using LLMs and CoT prompting.
Transfer LLMs' scene understanding to lightweight models for real-time edge device operation.
Improve prediction accuracy and robustness in complex traffic environments with semantic annotations.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses LLMs with CoT prompting for motion forecasting
Implements teacher-student knowledge distillation strategy
Introduces Highway-Text and Urban-Text datasets
๐Ÿ”Ž Similar Papers
No similar papers found.
H
Haicheng Liao
State Key Laboratory of Internet of Things for Smart City, University of Macau, Macau
H
Hanlin Kong
State Key Laboratory of Internet of Things for Smart City, University of Macau, Macau
Bonan Wang
Bonan Wang
Unknown affiliation
Trajectory Prediction
C
Chengyue Wang
State Key Laboratory of Internet of Things for Smart City, University of Macau, Macau
W
Wang Ye
Department of Computer and Information Science, University of Macau, Macau
Zhengbing He
Zhengbing He
MIT
traffic flow theoryautonomous drivingurban mobilitymachine learning
C
Chengzhong Xu
State Key Laboratory of Internet of Things for Smart City, University of Macau, Macau
Z
Zhenning Li
State Key Laboratory of Internet of Things for Smart City, University of Macau, Macau