๐ค AI Summary
This work proposes a geometry-driven image pair selection mechanism to overcome the high computational cost and limited reconstruction accuracy of traditional Structure-from-Motion (SfM) methods, which rely on visual similarity for pairing. Prior to feature matching, the approach evaluates the informativeness of candidate pairs using overlap and disparity metrics, constructs an information-weighted spanning tree, and incorporates a key-edge enhancement strategy. This method introduces the first โgeometry-firstโ pairing paradigm, substantially reducing matching complexity. Experiments demonstrate that, across multiple learning-based feature detectors, it achieves a 46.5% reduction in average rotation error, a 12.5% decrease in translation error, and a 98% reduction in the number of matched pairs, while accelerating the pipeline by up to 50รโall with reconstruction metric deviations maintained within ยฑ3%.
๐ Abstract
We present SARA (Scene-Aware Reconstruction Accelerator), a geometry-driven pair selection module for Structure-from-Motion (SfM). Unlike conventional pipelines that select pairs based on visual similarity alone, SARA introduces geometry-first pair selection by scoring reconstruction informativeness - the product of overlap and parallax - before expensive matching. A lightweight pre-matching stage uses mutual nearest neighbors and RANSAC to estimate these cues, then constructs an Information-Weighted Spanning Tree (IWST) augmented with targeted edges for loop closure, long-baseline anchors, and weak-view reinforcement. Compared to exhaustive matching, SARA reduces rotation errors by 46.5+-5.5% and translation errors by 12.5+-6.5% across modern learned detectors, while achieving at most 50x speedup through 98% pair reduction (from 30,848 to 580 pairs). This reduces matching complexity from quadratic to quasi-linear, maintaining within +-3% of baseline reconstruction metrics for 3D Gaussian Splatting and SVRaster.