DriveCode: Domain Specific Numerical Encoding for LLM-Based Autonomous Driving

๐Ÿ“… 2026-02-28
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Large language models (LLMs) struggle in autonomous driving tasks due to the discretization of numerical values into textual tokens, which fails to preserve the semantic ordering of digits and consequently undermines numerical reasoning accuracy and control command precision. To address this limitation, this work proposes DriveCodeโ€”a domain-specific numerical encoding method tailored for autonomous driving. DriveCode introduces dedicated numerical embeddings and a numerical projector that directly map continuous numerical values into the LLMโ€™s latent space, enabling seamless fusion with multimodal features. Evaluated on the OmniDrive, DriveGPT4, and DriveGPT4-V2 datasets, the proposed approach significantly improves trajectory prediction and control command generation, demonstrating its effectiveness and superiority in LLM-driven autonomous driving systems.

Technology Category

Application Category

๐Ÿ“ Abstract
Large language models (LLMs) have shown great promise for autonomous driving. However, discretizing numbers into tokens limits precise numerical reasoning, fails to reflect the positional significance of digits in the training objective, and makes it difficult to achieve both decoding efficiency and numerical precision. These limitations affect both the processing of sensor measurements and the generation of precise control commands, creating a fundamental barrier for deploying LLM-based autonomous driving systems. In this paper, we introduce DriveCode, a novel numerical encoding method that represents numbers as dedicated embeddings rather than discrete text tokens. DriveCode employs a number projector to map numbers into the language model's hidden space, enabling seamless integration with visual and textual features in a unified multimodal sequence. Evaluated on OmniDrive, DriveGPT4, and DriveGPT4-V2 datasets, DriveCode demonstrates superior performance in trajectory prediction and control signal generation, confirming its effectiveness for LLM-based autonomous driving systems.
Problem

Research questions and friction points this paper is trying to address.

numerical encoding
large language models
autonomous driving
tokenization
numerical reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

numerical encoding
large language models
autonomous driving
multimodal integration
number projector
๐Ÿ”Ž Similar Papers
No similar papers found.
Z
Zhiye Wang
School of Information Science and Engineering, Lanzhou University, Lanzhou, China
Yanbo Jiang
Yanbo Jiang
Tsinghua University
autonomous vehicle
R
Rui Zhou
School of Information Science and Engineering, Lanzhou University, Lanzhou, China
Bo Zhang
Bo Zhang
Tsinghua University
protein designbioinformatics
Fang Zhang
Fang Zhang
Alibaba Quantum Laboratory
quantum computation
Z
Zhenhua Xu
The School of Vehicle and Mobility, Tsinghua University, Beijing, China
Y
Yaqin Zhang
The Institute for AI Industry Research (AIR), Tsinghua University, Beijing, China
Jianqiang Wang
Jianqiang Wang
Associate Professor of Library and Information Studies, University at Buffalo
Information Retrievale-discovery