CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting

📅 2025-03-10

📈 Citations: 0

✨ Influential: 0

career value

155K/year

🤖 AI Summary

To address the joint requirements of accuracy, robustness, and real-time performance for autonomous driving motion prediction in complex traffic scenarios, this paper proposes the first lightweight motion prediction framework integrating large language models (LLMs) with chain-of-thought (CoT) prompting. Methodologically, it introduces: (1) a novel fine-tuning-free CoT semantic annotation generation paradigm that automatically produces high-quality traffic semantic labels; (2) knowledge distillation to transfer LLM-based scene understanding capabilities into an edge-deployable lightweight language model; and (3) Highway-Text and Urban-Text—the first publicly available text datasets tailored for traffic scene description. Evaluated on five real-world benchmarks, our approach surpasses state-of-the-art methods, reducing prediction error by 12.6%–23.4% while maintaining inference latency under 80 ms—meeting stringent real-time constraints for onboard edge devices.

Technology Category

Application Category

📝 Abstract

Accurate motion forecasting is crucial for safe autonomous driving (AD). This study proposes CoT-Drive, a novel approach that enhances motion forecasting by leveraging large language models (LLMs) and a chain-of-thought (CoT) prompting method. We introduce a teacher-student knowledge distillation strategy to effectively transfer LLMs' advanced scene understanding capabilities to lightweight language models (LMs), ensuring that CoT-Drive operates in real-time on edge devices while maintaining comprehensive scene understanding and generalization capabilities. By leveraging CoT prompting techniques for LLMs without additional training, CoT-Drive generates semantic annotations that significantly improve the understanding of complex traffic environments, thereby boosting the accuracy and robustness of predictions. Additionally, we present two new scene description datasets, Highway-Text and Urban-Text, designed for fine-tuning lightweight LMs to generate context-specific semantic annotations. Comprehensive evaluations of five real-world datasets demonstrate that CoT-Drive outperforms existing models, highlighting its effectiveness and efficiency in handling complex traffic scenarios. Overall, this study is the first to consider the practical application of LLMs in this field. It pioneers the training and use of a lightweight LLM surrogate for motion forecasting, setting a new benchmark and showcasing the potential of integrating LLMs into AD systems.

Problem

Research questions and friction points this paper is trying to address.

Enhance motion forecasting for autonomous driving using LLMs and CoT prompting.

Transfer LLMs' scene understanding to lightweight models for real-time edge device operation.

Improve prediction accuracy and robustness in complex traffic environments with semantic annotations.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses LLMs with CoT prompting for motion forecasting

Implements teacher-student knowledge distillation strategy

Introduces Highway-Text and Urban-Text datasets

🔎 Similar Papers

Large Language Models for Mobility Analysis in Transportation Systems: A Survey on Forecasting Tasks