GaINeR: Geometry-Aware Implicit Network Representation

📅 2025-11-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional implicit neural representations (INRs) lack explicit geometric structure when modeling 2D images, resulting in weak local editability and poor compatibility with physics-based simulation. To address this, we propose GaussINR—a novel framework that integrates learnable Gaussian distributions with neural implicit representation. For each query point, GaussINR retrieves the K nearest Gaussian kernels in space and aggregates their embeddings via distance-weighted fusion, followed by a feed-forward network to predict pixel RGB values. This enables continuous image modeling while encoding explicit geometric semantics. Crucially, GaussINR is the first INR method to embed parameterized geometric structure directly into its representation, thereby enhancing interpretability, enabling fine-grained local editing, and facilitating seamless integration with physical simulators. Extensive experiments demonstrate state-of-the-art fidelity on image reconstruction, super-resolution, and compression tasks. The framework further supports intuitive semantic editing and physics-aware synthesis. Code is publicly available.

Technology Category

Application Category

📝 Abstract
Implicit Neural Representations (INRs) have become an essential tool for modeling continuous 2D images, enabling high-fidelity reconstruction, super-resolution, and compression. Popular architectures such as SIREN, WIRE, and FINER demonstrate the potential of INR for capturing fine-grained image details. However, traditional INRs often lack explicit geometric structure and have limited capabilities for local editing or integration with physical simulation, restricting their applicability in dynamic or interactive settings. To address these limitations, we propose GaINeR: Geometry-Aware Implicit Network Representation, a novel framework for 2D images that combines trainable Gaussian distributions with a neural network-based INR. For a given image coordinate, the model retrieves the K nearest Gaussians, aggregates distance-weighted embeddings, and predicts the RGB value via a neural network. This design enables continuous image representation, interpretable geometric structure, and flexible local editing, providing a foundation for physically aware and interactive image manipulation. The official implementation of our method is publicly available at https://github.com/WJakubowska/GaINeR.
Problem

Research questions and friction points this paper is trying to address.

Enables geometric structure in implicit neural image representations
Overcomes limitations in local editing and physical simulation integration
Provides interpretable geometry for interactive image manipulation applications
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines trainable Gaussians with neural network INR
Retrieves nearest Gaussians for coordinate embeddings
Enables geometric structure and local image editing
🔎 Similar Papers
No similar papers found.
W
Weronika Jakubowska
Wrocław University of Science and Technology
M
Mikołaj Zieliński
Poznań University of Technology
R
Rafał Tobiasz
IDEAS Research Institute
K
Krzysztof Byrski
Jagiellonian University
M
Maciej Zięba
Wrocław University of Science and Technology
Dominik Belter
Dominik Belter
Institute of Robotics and Machine Intelligence, Poznan University of Technology
Walking robotsArtificial IntelligenceMachine Learning
Przemysław Spurek
Przemysław Spurek
Jagiellonian University, GMUM, IDEAS
mathematicscomputer science