The Road Ahead in Autonomous Driving: The KITScenes Multimodal Dataset

📅 2026-06-01
📈 Citations: 0
Influential: 0
📄 PDF

career value

217K/year
🤖 AI Summary
Existing autonomous driving datasets are limited in sensor fidelity, high-definition (HD) map completeness, and geographic diversity. This work proposes KITScenes Multimodal, a novel dataset that, for the first time in a publicly available benchmark, achieves reprojection-level accuracy in mapping full-element 3D traffic infrastructure. It integrates high-resolution global-shutter cameras, LiDAR with over 400-meter detection range, 4D imaging radar, and redundant GNSS/INS systems to enable synchronized multimodal data acquisition. Captured across irregular road networks and mixed-traffic scenarios in Europe, the dataset significantly enhances geographic diversity and map completeness. The authors introduce four core benchmark tasks—online HD mapping, long-range depth estimation, novel view synthesis, and end-to-end driving—and demonstrate the dataset’s effectiveness through evaluations in open-source frameworks.
📝 Abstract
Existing autonomous driving datasets have enabled major progress, but fall short in sensor fidelity, map completeness, or geographic diversity. We present KITScenes Multimodal, a European dataset built around high-fidelity sensors and maps. Our fully synchronized sensor suite combines high-resolution global-shutter cameras, long-range lidar beyond 400m, 4D imaging radar, and redundant GNSS/INS localization. Our HD maps are, to our knowledge, the most complete of any sensor dataset, validated through autonomous driving trials on open-source software. For the first time in a public dataset, all driving-relevant traffic elements, such as traffic lights, are mapped in 3D to a reprojection-accurate level with full topological connectivity. Recorded in cities with irregular street layouts and mixed traffic modes, our dataset complements existing datasets by broadening the available geographic diversity. We also introduce four benchmarks, each advancing spatial learning for embodied AI: online HD map construction, long-range depth estimation, novel view synthesis, and end-to-end driving. Project page: https://kitscenes.com/
Problem

Research questions and friction points this paper is trying to address.

autonomous driving
sensor fidelity
map completeness
geographic diversity
multimodal dataset
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal sensor fusion
high-definition mapping
4D imaging radar
geographic diversity
reprojection-accurate 3D traffic elements
🔎 Similar Papers
No similar papers found.
R
Richard Schwarzkopf
FZI Research Center for Information Technology, Karlsruhe Institute of Technology
Fabian Immel
Fabian Immel
PhD Student, Karlsruhe Institute of Technology / Research Scientist at FZI Forschungszentrum
Autonomous SystemsMachine LearningComputer Vision
A
Alexander Blumberg
Karlsruhe Institute of Technology
J
Jonas Merkert
Karlsruhe Institute of Technology
N
Nils Rack
Karlsruhe Institute of Technology
K
Kaiwen Wang
Karlsruhe Institute of Technology
Fabian Konstantinidis
Fabian Konstantinidis
PhD Candidate, Cariad SE, Karlsruhe Institute of Technology
Motion PredictionTrajectory Prediction
J
Julian Truetsch
FZI Research Center for Information Technology, Karlsruhe Institute of Technology
C
Carlos Fernandez
Karlsruhe Institute of Technology
A
Annika Bätz
Karlsruhe Institute of Technology
Kevin Rösch
Kevin Rösch
FZI Forschungszentrum Informatik
Data augmentationAutomated drivingSynthetic data generation
Marlon Steiner
Marlon Steiner
Research Associate at KIT
motion planningmotion predictionmachine learningreinforcement learning
W
Willi Poh
Karlsruhe Institute of Technology
Yinzhe Shen
Yinzhe Shen
PhD student at KIT
end-to-end autonomous driving
Royden Wagner
Royden Wagner
PhD student at KIT
GenAIself-supervised learningmechanistic interpretability
Felix Hauser
Felix Hauser
PhD student, research assistant, Karlsruhe Institute of Technology (KIT)
Machine learningExplainable AI
D
Dominik Strutz
Karlsruhe Institute of Technology
J
Jaime Villa
University Charles III of Madrid
G
Gleb Stepanov
Karlsruhe Institute of Technology
Holger Caesar
Holger Caesar
Assistant Professor at TU Delft
Autonomous VehiclesComputer Vision
Ömer Şahin Taş
Ömer Şahin Taş
Karlsruhe Institute of Technology (KIT) & FZI Research Center for Information Technology
Reinforcement LearningSelf-Supervised LearningAutonomous Driving
Frank Bieder
Frank Bieder
PhD Student, Karlsruhe Institute of Technology / Research Scientist at FZI Forschungszentrum
Autonomous SystemsMachine LearningComputer Vision
Jan-Hendrik Pauls
Jan-Hendrik Pauls
Research Group Leader Mapping and Localization, Karlsruhe Institute of Technology (KIT)
MappingLocalizationMap Learning
Christoph Stiller
Christoph Stiller
Professor of Measurement and Control Systems, Karlsruher Institut für Technologie, KIT, FZI
intelligent vehiclesautonomous drivingautonomous vehiclesintelligent transportation systems