🤖 AI Summary
Traditional implicit neural representations (INRs) lack explicit geometric structure when modeling 2D images, resulting in weak local editability and poor compatibility with physics-based simulation. To address this, we propose GaussINR—a novel framework that integrates learnable Gaussian distributions with neural implicit representation. For each query point, GaussINR retrieves the K nearest Gaussian kernels in space and aggregates their embeddings via distance-weighted fusion, followed by a feed-forward network to predict pixel RGB values. This enables continuous image modeling while encoding explicit geometric semantics. Crucially, GaussINR is the first INR method to embed parameterized geometric structure directly into its representation, thereby enhancing interpretability, enabling fine-grained local editing, and facilitating seamless integration with physical simulators. Extensive experiments demonstrate state-of-the-art fidelity on image reconstruction, super-resolution, and compression tasks. The framework further supports intuitive semantic editing and physics-aware synthesis. Code is publicly available.
📝 Abstract
Implicit Neural Representations (INRs) have become an essential tool for modeling continuous 2D images, enabling high-fidelity reconstruction, super-resolution, and compression. Popular architectures such as SIREN, WIRE, and FINER demonstrate the potential of INR for capturing fine-grained image details. However, traditional INRs often lack explicit geometric structure and have limited capabilities for local editing or integration with physical simulation, restricting their applicability in dynamic or interactive settings. To address these limitations, we propose GaINeR: Geometry-Aware Implicit Network Representation, a novel framework for 2D images that combines trainable Gaussian distributions with a neural network-based INR. For a given image coordinate, the model retrieves the K nearest Gaussians, aggregates distance-weighted embeddings, and predicts the RGB value via a neural network. This design enables continuous image representation, interpretable geometric structure, and flexible local editing, providing a foundation for physically aware and interactive image manipulation. The official implementation of our method is publicly available at https://github.com/WJakubowska/GaINeR.