FreqINR: Frequency Consistency for Implicit Neural Representation with Adaptive DCT Frequency Loss
Meiyi Wei, Liu Xie, Ying Sun, Gang Chen
TL;DR
FreqINR addresses frequency-domain mismatches that cause artifacts in arbitrary-scale SR by enforcing spectral consistency throughout training with Adaptive DCT Frequency Loss (ADFL) and by enlarging the encoder receptive field during inference. It combines a 2D DCT-based frequency representation, a Frequency Distance Matrix, and an Adaptive Frequency Weighting Matrix to dynamically focus on challenging frequencies, integrated into the overall loss as $L_{total} = L_{spatial} + \lambda L_{ADFL}$. An Enhanced Receptive Field encoder extends spectral coverage without significant cost, enabling better high-frequency detail transfer from LR inputs. Empirical results on DIV2K and other benchmarks show consistent PSNR improvements and superior qualitative texture reconstruction, establishing FreqINR as a lightweight yet effective framework for arbitrary-scale SR with potential applicability to other frequency-aware reconstruction tasks.
Abstract
Recent advancements in local Implicit Neural Representation (INR) demonstrate its exceptional capability in handling images at various resolutions. However, frequency discrepancies between high-resolution (HR) and ground-truth images, especially at larger scales, result in significant artifacts and blurring in HR images. This paper introduces Frequency Consistency for Implicit Neural Representation (FreqINR), an innovative Arbitrary-scale Super-resolution method aimed at enhancing detailed textures by ensuring spectral consistency throughout both training and inference. During training, we employ Adaptive Discrete Cosine Transform Frequency Loss (ADFL) to minimize the frequency gap between HR and ground-truth images, utilizing 2-Dimensional DCT bases and focusing dynamically on challenging frequencies. During inference, we extend the receptive field to preserve spectral coherence between low-resolution (LR) and ground-truth images, which is crucial for the model to generate high-frequency details from LR counterparts. Experimental results show that FreqINR, as a lightweight approach, achieves state-of-the-art performance compared to existing Arbitrary-scale Super-resolution methods and offers notable improvements in computational efficiency. The code for our method will be made publicly available.
