UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving

πŸ“… 2025-03-31
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This paper addresses the dual tasks of single-frame 3D occupancy prediction and multi-frame future occupancy forecasting in autonomous driving, proposing UniOccβ€”the first unified benchmark for both. UniOcc integrates multi-source real-world (nuScenes, Waymo) and simulation (CARLA, OpenCOOD) datasets, providing voxel-level 2D/3D occupancy annotations and optical flow labels, and supporting evaluation for both ego-vehicle and cooperative driving scenarios. Methodologically, it introduces explicit voxel-flow supervision to enhance temporal consistency, designs novel ground-truth-free evaluation metrics for occupancy forecasting, and establishes a cross-domain joint training framework. Its core innovation lies in achieving the first triple unification: across real/simulated data domains, between prediction and forecasting tasks, and between single-vehicle and collaborative perception settings. Experiments demonstrate consistent improvements: +6.2% average mIoU and βˆ’23% temporal coherence error over state-of-the-art methods.

Technology Category

Application Category

πŸ“ Abstract
We introduce UniOcc, a comprehensive, unified benchmark for occupancy forecasting (i.e., predicting future occupancies based on historical information) and current-frame occupancy prediction from camera images. UniOcc unifies data from multiple real-world datasets (i.e., nuScenes, Waymo) and high-fidelity driving simulators (i.e., CARLA, OpenCOOD), which provides 2D/3D occupancy labels with per-voxel flow annotations and support for cooperative autonomous driving. In terms of evaluation, unlike existing studies that rely on suboptimal pseudo labels for evaluation, UniOcc incorporates novel metrics that do not depend on ground-truth occupancy, enabling robust assessment of additional aspects of occupancy quality. Through extensive experiments on state-of-the-art models, we demonstrate that large-scale, diverse training data and explicit flow information significantly enhance occupancy prediction and forecasting performance.
Problem

Research questions and friction points this paper is trying to address.

Unified benchmark for occupancy forecasting and prediction in autonomous driving
Integrates multi-source data with 2D/3D occupancy and flow annotations
Proposes evaluation metrics independent of ground-truth occupancy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Unified benchmark for occupancy forecasting and prediction
Integrates multiple datasets with 2D/3D occupancy labels
Novel metrics for robust occupancy quality assessment
πŸ”Ž Similar Papers
No similar papers found.
Y
Yuping Wang
University of California, Riverside
X
Xiangyu Huang
University of Wisconsin, Madison
X
Xiaokang Sun
University of California, Riverside
Mingxuan Yan
Mingxuan Yan
University of California, Riverside
Shuo Xing
Shuo Xing
Texas A&M University
Large Language ModelsNatural Language ProcessingMachine Learning
Zhengzhong Tu
Zhengzhong Tu
Texas A&M University, Google Research, University of Texas at Austin
Agentic AITrustworthy AIEmbodied AI
J
Jiachen Li
University of California, Riverside