S2R-Bench: A Sim-to-Real Evaluation Benchmark for Autonomous Driving

📅 2025-05-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing autonomous driving perception benchmarks rely heavily on simulation, failing to capture real-world degradation scenarios—such as extreme weather and sensor failures—leading to a severe sim-to-real performance gap. Method: We introduce S2R-Bench, the first real-scenario-oriented perception robustness benchmark: (i) it establishes a sim-to-real evaluation framework grounded in real-vehicle-collected camera/LiDAR corruption data; (ii) it systematically models four-dimensional correlated degradations—temporal, weather, illumination, and road conditions; and (iii) it proposes a consistency analysis framework bridging simulation and real-world results, alongside a standardized corruption protocol and open-source evaluation toolkit (GitHub). Results: Experiments reveal that mainstream perception models suffer an average 32.7% mAP drop under real-world corruptions, exposing critical deployment risks. S2R-Bench provides a reproducible, comparable, and operationally relevant safety evaluation standard for perception algorithms.

Technology Category

Application Category

📝 Abstract
Safety is a long-standing and the final pursuit in the development of autonomous driving systems, with a significant portion of safety challenge arising from perception. How to effectively evaluate the safety as well as the reliability of perception algorithms is becoming an emerging issue. Despite its critical importance, existing perception methods exhibit a limitation in their robustness, primarily due to the use of benchmarks are entierly simulated, which fail to align predicted results with actual outcomes, particularly under extreme weather conditions and sensor anomalies that are prevalent in real-world scenarios. To fill this gap, in this study, we propose a Sim-to-Real Evaluation Benchmark for Autonomous Driving (S2R-Bench). We collect diverse sensor anomaly data under various road conditions to evaluate the robustness of autonomous driving perception methods in a comprehensive and realistic manner. This is the first corruption robustness benchmark based on real-world scenarios, encompassing various road conditions, weather conditions, lighting intensities, and time periods. By comparing real-world data with simulated data, we demonstrate the reliability and practical significance of the collected data for real-world applications. We hope that this dataset will advance future research and contribute to the development of more robust perception models for autonomous driving. This dataset is released on https://github.com/adept-thu/S2R-Bench.
Problem

Research questions and friction points this paper is trying to address.

Evaluating safety and reliability of autonomous driving perception algorithms
Addressing robustness gaps in simulated perception benchmarks
Providing real-world scenario data for autonomous driving perception testing
Innovation

Methods, ideas, or system contributions that make the work stand out.

Sim-to-Real benchmark for autonomous driving perception
Diverse real-world sensor anomaly data collection
Comprehensive evaluation under extreme conditions
🔎 Similar Papers
No similar papers found.
L
Li Wang
School of Machanical Engineering, Beijing Institute of Technology, Beijing 100081, China
G
Guangqi Yang
College of Computer Science and Technology, Jilin University, Changchun 130012, China
L
Lei Yang
School of Mechanical and Aerospace Engineering, Nanyang Technological University, Singapore 639798, Singapore
Ziying Song
Ziying Song
Beijing Jiaotong University
Object DetectionComputer VisionDeep Learning
X
Xinyu Zhang
School of Vehicle and Mobility and Suzhou Automotive Research Institute, Tsinghua University, Beijing 100084, China, and also with the Department of Electrical and Computer Engineering, National University of Singapore, Singapore 119077, Singapore
Y
Ying Chen
School of Vehicle and Mobility, Tsinghua University, Beijing 100084, China
L
Lin Liu
School of Computer and Information Technology, Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing 100044, China
Junjie Gao
Junjie Gao
MBZUAI NLP Msc
NLP Agent LLM
Z
Zhiwei Li
College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
Qingshan Yang
Qingshan Yang
Beijing Jingwei Hirain Technologies Co., Inc., Beijing 100191, China
J
Jun Li
School of Vehicle and Mobility, Tsinghua University, Beijing 100084, China
Liangliang Wang
Liangliang Wang
School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan 430070, China
W
Wenhao Yu
Beiiing Tsinghua Institute for Frontier Interdisciplinary Innovation, Beijing 100084, China
B
Bin Xu
School of Machanical Engineering, Beijing Institute of Technology, Beijing 100081, China
W
Weida Wang
School of Machanical Engineering, Beijing Institute of Technology, Beijing 100081, China
Huaping Liu
Huaping Liu
Professor of Electrical Engineering, Oregon State University
Communication theorywireless communicationssignal processingsensor networksinformation security