🤖 AI Summary
This work addresses the limited generalization of traditional modular autonomous driving systems in long-tail scenarios by proposing and validating a novel end-to-end Large Driving Model (LDM) paradigm. The LDM directly learns driving policies from raw multimodal sensor inputs, unifying perception, planning, and control within a single neural architecture. This approach redefines the human role as a safety supervisor and establishes a new product paradigm for L2++ autonomous driving. Empirical evidence from real-world deployments—such as Tesla’s Full Self-Driving (FSD) V12/V14 and Rivian’s Unified Intelligence—demonstrates the framework’s superior generalization capabilities in complex driving environments. These advances have already prompted multiple automakers to plan mass production deployment starting in 2026.
📝 Abstract
Autonomous driving is undergoing a shift from modular rule based pipelines toward end to end (E2E) learning systems. This paper examines this transition by tracing the evolution from classical sense perceive plan control architectures to large driving models (LDMs) capable of mapping raw sensor input directly to driving actions. We analyze recent developments including Tesla's Full Self Driving (FSD) V12 V14, Rivian's Unified Intelligence platform, NVIDIA Cosmos, and emerging commercial robotaxi deployments, focusing on architectural design, deployment strategies, safety considerations and industry implications. A key emerging product category is supervised E2E driving, often referred to as FSD (Supervised) or L2 plus plus, which several manufacturers plan to deploy from 2026 onwards. These systems can perform most of the Dynamic Driving Task (DDT) in complex environments while requiring human supervision, shifting the driver's role to safety oversight. Early operational evidence suggests E2E learning handles the long tail distribution of real world driving scenarios and is becoming a dominant commercial strategy. We also discuss how similar architectural advances may extend beyond autonomous vehicles (AV) to other embodied AI systems, including humanoid robotics.