GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous Driving

📅 2025-11-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenge of representing irregular obstacles with 3D bounding boxes in autonomous driving, this paper proposes GUIDE—a unified framework leveraging 3D Gaussian ellipsoids for both instance detection and occupancy prediction. Methodologically, we introduce a sparse Gaussian-to-voxel projection strategy that preserves accuracy while substantially reducing computational overhead; further, we integrate instance-level occupancy prediction with multi-frame Gaussian parameter association to achieve robust 3D instance tracking. On the nuScenes benchmark, GUIDE achieves an instance occupancy mAP of 21.61—50% higher than the state-of-the-art—while also delivering significant gains in detection and tracking performance. Key contributions include: (1) the first unified modeling of 3D Gaussians across perception tasks; (2) a novel sparse Gaussian–voxel projection mechanism; and (3) an end-to-end trainable joint framework for instance occupancy prediction and tracking.

Technology Category

Application Category

📝 Abstract
In the realm of autonomous driving, accurately detecting surrounding obstacles is crucial for effective decision-making. Traditional methods primarily rely on 3D bounding boxes to represent these obstacles, which often fail to capture the complexity of irregularly shaped, real-world objects. To overcome these limitations, we present GUIDE, a novel framework that utilizes 3D Gaussians for instance detection and occupancy prediction. Unlike conventional occupancy prediction methods, GUIDE also offers robust tracking capabilities. Our framework employs a sparse representation strategy, using Gaussian-to-Voxel Splatting to provide fine-grained, instance-level occupancy data without the computational demands associated with dense voxel grids. Experimental validation on the nuScenes dataset demonstrates GUIDE's performance, with an instance occupancy mAP of 21.61, marking a 50% improvement over existing methods, alongside competitive tracking capabilities. GUIDE establishes a new benchmark in autonomous perception systems, effectively combining precision with computational efficiency to better address the complexities of real-world driving environments.
Problem

Research questions and friction points this paper is trying to address.

Detecting irregularly shaped obstacles in autonomous driving using 3D Gaussians
Providing instance-level occupancy prediction without dense voxel computation
Overcoming limitations of traditional 3D bounding box obstacle representation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses 3D Gaussians for instance detection
Employs Gaussian-to-Voxel Splatting strategy
Provides instance-level occupancy with tracking
🔎 Similar Papers
No similar papers found.
C
Chunyong Hu
Unmanned Vehicle Dept., CaiNiao Inc., Alibaba Group
Q
Qi Luo
Unmanned Vehicle Dept., CaiNiao Inc., Alibaba Group
Jianyun Xu
Jianyun Xu
Alibaba DAMO Academy
3D PerceptionAutonomous Driving
S
Song Wang
Unmanned Vehicle Dept., CaiNiao Inc., Alibaba Group; Zhejiang University
Q
Qiang Li
Unmanned Vehicle Dept., CaiNiao Inc., Alibaba Group
S
Sheng Yang
Unmanned Vehicle Dept., CaiNiao Inc., Alibaba Group