Exploring Kernel Transformations for Implicit Neural Representations
Sheng Zheng, Chaoning Zhang, Dongshen Han, Fachrina Dewi Puspitasari, Xinhong Hao, Yang Yang, Heng Tao Shen
TL;DR
The paper investigates kernel transformations applied to the input and output of implicit neural representations (INRs) as an alternative to modifying internal model components. It finds that nonlinear kernels degrade performance, while linear transformations—specifically input scaling and adaptive output shifting—consistently improve accuracy, motivating the SS-INR framework. Across image fitting, CT reconstruction, and audio representation, SS-INR yields gains for multiple INR backbones, indicating the approach's robustness and practical value. The work also offers depth- and normalization-based interpretations to explain why these simple I/O transformations help, and it opens avenues for future exploration of dynamic kernel transformations in INRs.
Abstract
Implicit neural representations (INRs), which leverage neural networks to represent signals by mapping coordinates to their corresponding attributes, have garnered significant attention. They are extensively utilized for image representation, with pixel coordinates as input and pixel values as output. In contrast to prior works focusing on investigating the effect of the model's inside components (activation function, for instance), this work pioneers the exploration of the effect of kernel transformation of input/output while keeping the model itself unchanged. A byproduct of our findings is a simple yet effective method that combines scale and shift to significantly boost INR with negligible computation overhead. Moreover, we present two perspectives, depth and normalization, to interpret the performance benefits caused by scale and shift transformation. Overall, our work provides a new avenue for future works to understand and improve INR through the lens of kernel transformation.
