FLAIR: Frequency- and Locality-Aware Implicit Neural Representations
Sukhun Ko, Seokhyun Yoon, Dahyeon Kye, Kyle Min, Chanho Eom, Jihyong Oh
TL;DR
FLAIR addresses spectral bias and poor locality in implicit neural representations by introducing Band-Localized Activation (BLA) for adaptive frequency control and locality, and Wavelet-Energy-Guided Encoding (WEGE) for region-aware frequency guidance using a lightweight, plug-and-play wavelet prior. The approach is analyzed through TFUP and empirical NTK perspectives, and validated across 2D image fitting, 3D SDF reconstruction, and NeRF with strong, fast-converging performance and compact models. Ablations confirm the complementary benefits of BLA components and WEGE, while experiments demonstrate robustness in arbitrary-scale SR and limited-view NeRF. Limitations include performance gaps under extreme degradation and per-scene training requirements, suggesting future work on generalization and diffusion-based baselines for more extreme tasks.
Abstract
Implicit Neural Representations (INRs) leverage neural networks to map coordinates to corresponding signals, enabling continuous and compact representations. This paradigm has driven significant advances in various vision tasks. However, existing INRs lack frequency selectivity and spatial localization, leading to an over-reliance on redundant signal components. Consequently, they exhibit spectral bias, tending to learn low-frequency components early while struggling to capture fine high-frequency details. To address these issues, we propose FLAIR (Frequency- and Locality-Aware Implicit Neural Representations), which incorporates two key innovations. The first is Band-Localized Activation (BLA), a novel activation designed for joint frequency selection and spatial localization under the constraints of the time-frequency uncertainty principle (TFUP). Through structured frequency control and spatially localized responses, BLA effectively mitigates spectral bias and enhances training stability. The second is Wavelet-Energy-Guided Encoding (WEGE), which leverages the discrete wavelet transform to compute energy scores and explicitly guide frequency information to the network, enabling precise frequency selection and adaptive band control. Our method consistently outperforms existing INRs in 2D image representation, as well as 3D shape reconstruction and novel view synthesis.
