SP-SLAM: Neural Real-Time Dense SLAM With Scene Priors

📅 2025-01-11
🏛️ IEEE transactions on circuits and systems for video technology (Print)
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing neural implicit SLAM methods suffer from ambiguous reconstructions and poor real-time performance, primarily due to ineffective modeling of scene priors. This paper introduces SP-SLAM—the first real-time RGB-D dense SLAM system integrating sparse voxel encoding priors with tri-plane representations. Its core contributions are: (1) a surface-proximity-aware sparse voxel prior that accelerates implicit field convergence; (2) inter-frame global voxel fusion coupled with joint online optimization over all historical poses; and (3) a lightweight tri-plane feature storage scheme that balances texture fidelity and computational efficiency under memory constraints. Evaluated on five standard benchmarks—including Replica—SP-SLAM achieves significant improvements in pose accuracy and reconstruction completeness while sustaining >30 FPS real-time performance, consistently outperforming state-of-the-art methods.

Technology Category

Application Category

📝 Abstract
Neural implicit representations have recently shown promising progress in dense Simultaneous Localization And Mapping (SLAM). However, existing works have shortcomings in terms of reconstruction quality and real-time performance, mainly due to inflexible scene representation strategy without leveraging any prior information. In this paper, we introduce SP-SLAM, a novel neural RGB-D SLAM system that performs tracking and mapping in real-time. SP-SLAM computes depth images and establishes sparse voxel-encoded scene priors near the surfaces to achieve rapid convergence of the model. Subsequently, the encoding voxels computed from single-frame depth image are fused into a global volume, which facilitates high-fidelity surface reconstruction. Simultaneously, we employ tri-planes to store scene appearance information, striking a balance between achieving high-quality geometric texture mapping and minimizing memory consumption. Furthermore, in SP-SLAM, we introduce an effective optimization strategy for mapping, allowing the system to continuously optimize the poses of all historical input frames during runtime without increasing computational overhead. We conduct extensive evaluations on five benchmark datasets (Replica, ScanNet, TUM RGB-D, Synthetic RGB-D, 7-Scenes). The results demonstrate that, compared to existing methods, we achieve superior tracking accuracy and reconstruction quality, while running at a significantly faster speed.
Problem

Research questions and friction points this paper is trying to address.

Neural Implicit Representations
SLAM
Real-time Processing
Innovation

Methods, ideas, or system contributions that make the work stand out.

Real-time Dense SLAM
Depth Image Accelerated Learning
Optimized Tracking and Reconstruction
🔎 Similar Papers
No similar papers found.
Z
Zhen Hong
Institute of Cyberspace Security and College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
B
Bowen Wang
Institute of Cyberspace Security and College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
Haoran Duan
Haoran Duan
Tsinghua/Newcastle/Durham University
Multimodal AIGenerative AI
Y
Yawen Huang
Tencent Jarvis Lab, Shenzhen 518057, China
Xiong Li
Xiong Li
University of Electronic Science and Technology of China
Information securityCryptography
Zhenyu Wen
Zhenyu Wen
Zhejiang University of Technology
AI SystemCloud computingSocial computingDistributed computing
X
Xiang Wu
Institute of Cyberspace Security and College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
W
Wei Xiang
School of Engineering and Mathematical Sciences, La Trobe University, Melbourne, VIC 3086, Australia
Yefeng Zheng
Yefeng Zheng
Professor, Westlake University, Hangzhou, China, IEEE Fellow, AIMBE Fellow
AI in HealthMedical ImagingComputer VisionNatural Language ProcessingLarge Language Model