Wrivinder: Towards Spatial Intelligence for Geo-locating Ground Images onto Satellite Imagery

๐Ÿ“… 2026-02-16
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the challenge of accurately aligning ground-level images with satellite imagery in the presence of large viewpoint discrepancies or unreliable GPS signals. To this end, the authors propose a zero-shot, geometry-driven cross-view geolocalization framework that fuses multi-view ground images to generate a consistent nadir view for matching against satellite imagery. The approach integrates Structure-from-Motion (SfM) reconstruction, 3D Gaussian Splatting, semantic anchoring, and monocular depth cues, enabling geometrically centered alignment without any paired supervisionโ€”a first in the field. As part of this contribution, the authors introduce MC-Sat, the first systematic benchmark dataset for cross-view geolocalization. Evaluated under a zero-shot setting, the method achieves sub-30-meter localization accuracy across both dense urban and large-scale scenes.

Technology Category

Application Category

๐Ÿ“ Abstract
Aligning ground-level imagery with geo-registered satellite maps is crucial for mapping, navigation, and situational awareness, yet remains challenging under large viewpoint gaps or when GPS is unreliable. We introduce Wrivinder, a zero-shot, geometry-driven framework that aggregates multiple ground photographs to reconstruct a consistent 3D scene and align it with overhead satellite imagery. Wrivinder combines SfM reconstruction, 3D Gaussian Splatting, semantic grounding, and monocular depth--based metric cues to produce a stable zenith-view rendering that can be directly matched to satellite context for metrically accurate camera geo-localization. To support systematic evaluation of this task, which lacks suitable benchmarks, we also release MC-Sat, a curated dataset linking multi-view ground imagery with geo-registered satellite tiles across diverse outdoor environments. Together, Wrivinder and MC-Sat provide a first comprehensive baseline and testbed for studying geometry-centered cross-view alignment without paired supervision. In zero-shot experiments, Wrivinder achieves sub-30\,m geolocation accuracy across both dense and large-area scenes, highlighting the promise of geometry-based aggregation for robust ground-to-satellite localization.
Problem

Research questions and friction points this paper is trying to address.

geo-localization
cross-view alignment
ground-to-satellite
spatial intelligence
viewpoint gap
Innovation

Methods, ideas, or system contributions that make the work stand out.

zero-shot geo-localization
geometry-driven alignment
3D Gaussian Splatting
cross-view matching
SfM reconstruction
๐Ÿ”Ž Similar Papers
No similar papers found.
C
Chandrakanth Gudavalli
Mayachitra, Inc.
T
Tajuddin Manhar Mohammed
Mayachitra, Inc.
A
Abhay Yadav
Johns Hopkins University
A
Ananth Vishnu Bhaskar
Mayachitra, Inc.
Hardik Prajapati
Hardik Prajapati
Ph.D. student at UCSB
Computer VisionMachine Learning
Cheng Peng
Cheng Peng
Johns Hopkins University
Computer Vision3D ReconstructionGenerative ModelingMedical Image Analysis
Rama Chellappa
Rama Chellappa
Bloomberg Distinguished Professor, Johns Hopkins University
Image Analysisartificial intelligencebiometricsComputer VisionBiomedical Data Science
S
Shivkumar Chandrasekaran
Mayachitra, Inc.
B
B. S. Manjunath
Mayachitra, Inc.