EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting

πŸ“… 2025-06-26
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address SLAM tracking drift caused by photometric inconsistency on non-Lambertian surfaces and respiratory motion in endoscopic surgery, this paper proposes a highly robust real-time 3D reconstruction method. The approach jointly optimizes SLAM pose estimation and scene reconstruction. Key contributions include: (1) the first optical-flow-constrained 3D Gaussian splatting optimization framework incorporating geometric consistency priors; (2) a depth regularization strategy ensuring the validity and stability of sparse endoscopic depth measurements; and (3) a keyframe-quality-aware adaptive Gaussian refinement mechanism to enhance modeling accuracy in dynamic scenes. Evaluated on C3VD and StereoMIS datasets, the method achieves a 2.1 dB PSNR improvement in novel-view synthesis and reduces absolute trajectory error (ATE) by 37% over state-of-the-art methods. It supports real-time operation in both static and dynamic surgical scenarios.

Technology Category

Application Category

πŸ“ Abstract
Efficient three-dimensional reconstruction and real-time visualization are critical in surgical scenarios such as endoscopy. In recent years, 3D Gaussian Splatting (3DGS) has demonstrated remarkable performance in efficient 3D reconstruction and rendering. Most 3DGS-based Simultaneous Localization and Mapping (SLAM) methods only rely on the appearance constraints for optimizing both 3DGS and camera poses. However, in endoscopic scenarios, the challenges include photometric inconsistencies caused by non-Lambertian surfaces and dynamic motion from breathing affects the performance of SLAM systems. To address these issues, we additionally introduce optical flow loss as a geometric constraint, which effectively constrains both the 3D structure of the scene and the camera motion. Furthermore, we propose a depth regularisation strategy to mitigate the problem of photometric inconsistencies and ensure the validity of 3DGS depth rendering in endoscopic scenes. In addition, to improve scene representation in the SLAM system, we improve the 3DGS refinement strategy by focusing on viewpoints corresponding to Keyframes with suboptimal rendering quality frames, achieving better rendering results. Extensive experiments on the C3VD static dataset and the StereoMIS dynamic dataset demonstrate that our method outperforms existing state-of-the-art methods in novel view synthesis and pose estimation, exhibiting high performance in both static and dynamic surgical scenes. The source code will be publicly available upon paper acceptance.
Problem

Research questions and friction points this paper is trying to address.

Address photometric inconsistencies in endoscopic SLAM
Improve 3D reconstruction with flow-constrained Gaussian splatting
Enhance real-time surgical scene representation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses optical flow loss for geometric constraints
Implements depth regularization for photometric inconsistencies
Refines 3DGS strategy with keyframe focus
πŸ”Ž Similar Papers
No similar papers found.
T
Taoyu Wu
School of Advanced Technology, Xi’an Jiaotong Liverpool University, Suzhou, China
Y
Yiyi Miao
School of AI and Advanced Computing, Xi’an Jiaotong Liverpool University, Suzhou, China; School of Electrical Engineering, Electronics and Computer Science, University of Liverpool, Liverpool, United Kingdom
Z
Zhuoxiao Li
School of Advanced Technology, Xi’an Jiaotong Liverpool University, Suzhou, China
Haocheng Zhao
Haocheng Zhao
Xi'an Jiaotong Liverpool University
Neural NetworksNeural Network PruningRadar-Camera Fusion
Kang Dang
Kang Dang
Xi'an Jiaotong-Liverpool University
Computer VisionMedicial Image Analysis
Jionglong Su
Jionglong Su
Xi'an Jiaotong-Liverpool University
AI Big Data Machine Learning Statistics
Limin Yu
Limin Yu
Xi'an Jiaotong-Liverpool University
sonar detectionrational waveletsmedical image analysisAGV system design
Haoang Li
Haoang Li
Assistant Professor, Hong Kong University of Science and Technology (Guangzhou)
Robotics3D Computer Vision