FIPER: Factorized Features for Robust Image Super-Resolution and Compression

Yang-Che Sun; Cheng Yu Yeo; Ernie Chu; Jun-Cheng Chen; Yu-Lun Liu

FIPER: Factorized Features for Robust Image Super-Resolution and Compression

Yang-Che Sun, Cheng Yu Yeo, Ernie Chu, Jun-Cheng Chen, Yu-Lun Liu

TL;DR

This work tackles the unification of low-level vision tasks by introducing Factorized Features, a basis–coefficient representation that explicitly encodes multi-scale frequencies. The approach comprises a Coefficient Backbone and a Basis Swin Transformer to generate spatially varying coefficients and content-aware bases, enabling accurate reconstruction via a coordinated sampling and reconstruction pipeline. By applying this representation to both Single-Image Super-Resolution and Learned Image Compression, and extending it to Multi-Image Compression with a Basis Merging Transformer, the method achieves state-of-the-art rate–distortion performance and substantial PSNR gains, while preserving structural detail such as repetitive patterns. The framework demonstrates the value of a frequency-aware, shared-basis representation for robust image reconstruction and compression, with potential extension to broader low-level vision tasks and video. Key ideas include $f(\mathbf{x}) = \sum_{i=1}^N c_i(\mathbf{x})\, b_i(\mathbf{x})$, learned bases $b_i$, coordinate transforms $\gamma$, and multi-frequency modulation $\{\alpha_j, \psi(\cdot)\}$ that jointly capture high- and low-frequency content.

Abstract

In this work, we propose using a unified representation, termed Factorized Features, for low-level vision tasks, where we test on Single Image Super-Resolution (SISR) and \textbf{Image Compression}. Motivated by the shared principles between these tasks, they require recovering and preserving fine image details, whether by enhancing resolution for SISR or reconstructing compressed data for Image Compression. Unlike previous methods that mainly focus on network architecture, our proposed approach utilizes a basis-coefficient decomposition as well as an explicit formulation of frequencies to capture structural components and multi-scale visual features in images, which addresses the core challenges of both tasks. We replace the representation of prior models from simple feature maps with Factorized Features to validate the potential for broad generalizability. In addition, we further optimize the compression pipeline by leveraging the mergeable-basis property of our Factorized Features, which consolidates shared structures on multi-frame compression. Extensive experiments show that our unified representation delivers state-of-the-art performance, achieving an average relative improvement of 204.4% in PSNR over the baseline in Super-Resolution (SR) and 9.35% BD-rate reduction in Image Compression compared to the previous SOTA. Project page: https://jayisaking.github.io/FIPER/

FIPER: Factorized Features for Robust Image Super-Resolution and Compression

TL;DR

Abstract

FIPER: Factorized Features for Robust Image Super-Resolution and Compression

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)