Combi-CAM: A Novel Multi-Layer Approach for Explainable Image Geolocalization

📅 2026-03-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limited interpretability of existing vision-based image geolocation models, which often fail to reveal the rationale behind their predictions. To overcome this limitation, the authors propose a multi-layer gradient-weighted class activation mapping (Grad-CAM) fusion strategy that moves beyond conventional approaches relying solely on the deepest convolutional features. By integrating activation signals from both intermediate and deep layers of a convolutional neural network, the method generates finer-grained and more comprehensive visual explanations of model decisions. This approach substantially enhances model transparency and trustworthiness, outperforming existing single-layer Grad-CAM techniques in interpretability and providing richer, more accurate justifications for geolocation predictions.

Technology Category

Application Category

📝 Abstract
Planet-scale photo geolocalization involves the intricate task of estimating the geographic location depicted in an image purely based on its visual features. While deep learning models, particularly convolutional neural networks (CNNs), have significantly advanced this field, understanding the reasoning behind their predictions remains challenging. In this paper, we present Combi-CAM, a novel method that enhances the explainability of CNN-based geolocalization models by combining gradient-weighted class activation maps obtained from several layers of the network architecture, rather than using only information from the deepest layer as is typically done. This approach provides a more detailed understanding of how different image features contribute to the model's decisions, offering deeper insights than the traditional approaches.
Problem

Research questions and friction points this paper is trying to address.

image geolocalization
explainability
deep learning
convolutional neural networks
model interpretability
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combi-CAM
explainable AI
image geolocalization
multi-layer activation maps
CNN interpretability
🔎 Similar Papers
No similar papers found.
D
David Faget
Centre Borelli, ENS Paris-Saclay, Université de Paris, CNRS, INSERM, SSA, France
J
José Luis Lisani
Universitat de les Illes Balears, IAC3, Spain
Miguel Colom
Miguel Colom
Senior Researcher at Centre Borelli, ENS Paris-Saclay
Noise estimationimage denoisingstatistical image modelsreproducible researchsatellite design