GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution
Jintong Hu, Bin Xia, Bin Chen, Wenming Yang, Lei Zhang
TL;DR
This work targets the fidelity limitations of implicit neural representations for arbitrary-scale image super-resolution by introducing GaussianSR, which models each pixel as a continuous Gaussian field and learns a per-pixel Gaussian kernel via a classifier. The approach leverages 2D Gaussian Splatting, LR feature initialization, and a dual-stream upsampling architecture to enable end-to-end training with fewer parameters while capturing long-range dependencies. Empirical results across DIV2K, General100, BSD100, Urban100, and Manga109 show competitive or superior performance, especially on noninteger scales and high-resolution content, with ablations supporting the effectiveness of the Gaussian bank size and channel decoupling. Overall, GaussianSR establishes a new paradigm for ASSR by providing a continuous, interpretable, and efficient representation that flexibly adapts to input characteristics through learned Gaussian kernels.
Abstract
Implicit neural representations (INRs) have significantly advanced the field of arbitrary-scale super-resolution (ASSR) of images. Most existing INR-based ASSR networks first extract features from the given low-resolution image using an encoder, and then render the super-resolved result via a multi-layer perceptron decoder. Although these approaches have shown promising results, their performance is constrained by the limited representation ability of discrete latent codes in the encoded features. In this paper, we propose a novel ASSR method named GaussianSR that overcomes this limitation through 2D Gaussian Splatting (2DGS). Unlike traditional methods that treat pixels as discrete points, GaussianSR represents each pixel as a continuous Gaussian field. The encoded features are simultaneously refined and upsampled by rendering the mutually stacked Gaussian fields. As a result, long-range dependencies are established to enhance representation ability. In addition, a classifier is developed to dynamically assign Gaussian kernels to all pixels to further improve flexibility. All components of GaussianSR (i.e., encoder, classifier, Gaussian kernels, and decoder) are jointly learned end-to-end. Experiments demonstrate that GaussianSR achieves superior ASSR performance with fewer parameters than existing methods while enjoying interpretable and content-aware feature aggregations.
