GaINeR: Geometry-Aware Implicit Network Representation
Weronika Jakubowska, Mikołaj Zieliński, Rafał Tobiasz, Krzysztof Byrski, Maciej Zięba, Dominik Belter, Przemysław Spurek
TL;DR
GaINeR addresses the lack of explicit geometry in 2D implicit neural representations by introducing trainable Gaussian embeddings that condition an INR decoder via radius-limited KNN aggregation, yielding continuous, geometry-aware reconstructions and intuitive local edits. The framework extends to 2D-to-3D lifting by lifting Gaussian means into 3D and predicting density for NeRF-like rendering, enabling depth-aware novel viewpoints and integration with physical simulations. Across Kodak and DIV2K, GaINeR achieves state-of-the-art reconstruction quality and demonstrates robust editing and physics-enabled dynamics, while maintaining a coherent, interpretable geometric prior. This work provides a unified, editable, geometry-aware representation that bridges 2D perception and 3D reasoning, with broad implications for interactive graphics and simulation-based learning.
Abstract
Implicit Neural Representations (INRs) have become an essential tool for modeling continuous 2D images, enabling high-fidelity reconstruction, super-resolution, and compression. Popular architectures such as SIREN, WIRE, and FINER demonstrate the potential of INR for capturing fine-grained image details. However, traditional INRs often lack explicit geometric structure and have limited capabilities for local editing or integration with physical simulation, restricting their applicability in dynamic or interactive settings. To address these limitations, we propose GaINeR: Geometry-Aware Implicit Network Representation, a novel framework for 2D images that combines trainable Gaussian distributions with a neural network-based INR. For a given image coordinate, the model retrieves the K nearest Gaussians, aggregates distance-weighted embeddings, and predicts the RGB value via a neural network. This design enables continuous image representation, interpretable geometric structure, and flexible local editing, providing a foundation for physically aware and interactive image manipulation. The official implementation of our method is publicly available at https://github.com/WJakubowska/GaINeR.
