TeraSim-World: Worldwide Safety-Critical Data Synthesis for End-to-End Autonomous Driving

📅 2025-09-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the scarcity of real safety-critical data, high cost and risk of on-road testing, and significant sim-to-real domain gap in end-to-end autonomous driving, this paper proposes the first fully pipeline framework for synthesizing safety-critical scenarios adaptable to arbitrary geographic locations. Methodologically, it integrates high-definition maps, naturalistic driving trajectories, and traffic demand modeling to construct a geography-aware adversarial scenario generation pipeline; it further innovatively incorporates the Cosmos-Drive video generation model to achieve street-level semantic alignment and photorealistic sensor rendering. Experiments generate high-fidelity synthetic data spanning diverse climates, terrains, and traffic densities. Results demonstrate substantial reduction in the sim-to-real gap across multiple benchmarks—e.g., detection error reduced by 32.7%—effectively supporting end-to-end model training and robustness validation.

Technology Category

Application Category

📝 Abstract
Safe and scalable deployment of end-to-end (E2E) autonomous driving requires extensive and diverse data, particularly safety-critical events. Existing data are mostly generated from simulators with a significant sim-to-real gap or collected from on-road testing that is costly and unsafe. This paper presents TeraSim-World, an automated pipeline that synthesizes realistic and geographically diverse safety-critical data for E2E autonomous driving at anywhere in the world. Starting from an arbitrary location, TeraSim-World retrieves real-world maps and traffic demand from geospatial data sources. Then, it simulates agent behaviors from naturalistic driving datasets, and orchestrates diverse adversities to create corner cases. Informed by street views of the same location, it achieves photorealistic, geographically grounded sensor rendering via the frontier video generation model Cosmos-Drive. By bridging agent and sensor simulations, TeraSim-World provides a scalable and critical~data synthesis framework for training and evaluation of E2E autonomous driving systems.
Problem

Research questions and friction points this paper is trying to address.

Synthesizes realistic safety-critical data for autonomous driving
Bridges sim-to-real gap with geographically diverse scenarios
Generates photorealistic corner cases for E2E system evaluation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automated pipeline synthesizes realistic safety-critical data
Simulates agent behaviors from naturalistic driving datasets
Achieves photorealistic rendering via Cosmos-Drive model
🔎 Similar Papers
No similar papers found.
J
Jiawei Wang
University of Michigan
Haowei Sun
Haowei Sun
University of Michigan
Intelligent Transportation
Xintao Yan
Xintao Yan
Assistant Professor, The University of Hong Kong
Intelligent VehiclesSimulationDriver BehaviorAI Safety
S
Shuo Feng
Tsinghua University
J
Jun Gao
University of Michigan, NVIDIA
H
Henry X. Liu
University of Michigan