Visual Loop Closure Detection Through Deep Graph Consensus

📅 2025-05-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Visual loop closure detection suffers from high false-positive rates and computationally expensive RANSAC-based geometric verification, limiting both accuracy and real-time performance of online SLAM systems. To address these challenges, we propose LoopGNN—the first approach to integrate Graph Neural Networks (GNNs) into loop closure detection. LoopGNN constructs a multi-frame neighborhood graph over keyframes, models local consistency via visually similar keyframe cliques, and achieves robust feature consensus estimation through deep feature propagation. Additionally, a multi-scale keypoint extractor is introduced to enhance generalization. Evaluated on TartanDrive 2.0 and NCLT datasets, LoopGNN significantly outperforms conventional and state-of-the-art deep learning methods, achieving high recall while drastically reducing verification latency—surpassing RANSAC in computational efficiency. The source code and datasets are publicly available.

Technology Category

Application Category

📝 Abstract
Visual loop closure detection traditionally relies on place recognition methods to retrieve candidate loops that are validated using computationally expensive RANSAC-based geometric verification. As false positive loop closures significantly degrade downstream pose graph estimates, verifying a large number of candidates in online simultaneous localization and mapping scenarios is constrained by limited time and compute resources. While most deep loop closure detection approaches only operate on pairs of keyframes, we relax this constraint by considering neighborhoods of multiple keyframes when detecting loops. In this work, we introduce LoopGNN, a graph neural network architecture that estimates loop closure consensus by leveraging cliques of visually similar keyframes retrieved through place recognition. By propagating deep feature encodings among nodes of the clique, our method yields high-precision estimates while maintaining high recall. Extensive experimental evaluations on the TartanDrive 2.0 and NCLT datasets demonstrate that LoopGNN outperforms traditional baselines. Additionally, an ablation study across various keypoint extractors demonstrates that our method is robust, regardless of the type of deep feature encodings used, and exhibits higher computational efficiency compared to classical geometric verification baselines. We release our code, supplementary material, and keyframe data at https://loopgnn.cs.uni-freiburg.de.
Problem

Research questions and friction points this paper is trying to address.

Improving visual loop closure detection accuracy and efficiency
Reducing false positives in online SLAM scenarios
Enhancing loop verification using graph neural networks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses graph neural network for loop closure
Leverages cliques of visually similar keyframes
Propagates deep feature encodings among nodes
🔎 Similar Papers
No similar papers found.
M
Martin Buchner
Department of Computer Science, University of Freiburg, Germany.
L
Liza Dahiya
Solution System Development Center in Honda R&D Co,. Ltd., Japan.
S
S. Dorer
Department of Computer Science, University of Freiburg, Germany.
Vipul Ramtekkar
Vipul Ramtekkar
Honda R&D Co., Ltd.
Computer VisionRobotics
K
Kenji Nishimiya
Solution System Development Center in Honda R&D Co,. Ltd., Japan.
D
Daniele Cattaneo
Department of Computer Science, University of Freiburg, Germany.
Abhinav Valada
Abhinav Valada
Professor & Director of Robot Learning Lab, University of Freiburg
RoboticsMachine LearningComputer VisionArtificial Intelligence