CubeFormer: A Simple yet Effective Baseline for Lightweight Image Super-Resolution
Jikai Wang, Huan Zheng, Jianbing Shen
TL;DR
CubeFormer addresses limited feature diversity in lightweight image super-resolution by introducing cube attention, which extends 2D attention into 3D cubes to enable exhaustive spatial–channel interactions. It employs cascaded Cube Transformer Groups with intra- and inter-cube transformer blocks to model local and global information, and offers CubeFormer-lite for efficiency via channel splitting. The model is trained with a joint spatial and frequency reconstruction loss to preserve texture and high-frequency details. Across standard benchmarks, CubeFormer achieves state-of-the-art performance for lightweight SR and demonstrates improved texture detail over prior methods, with CubeFormer-lite offering strong efficiency gains.
Abstract
Lightweight image super-resolution (SR) methods aim at increasing the resolution and restoring the details of an image using a lightweight neural network. However, current lightweight SR methods still suffer from inferior performance and unpleasant details. Our analysis reveals that these methods are hindered by constrained feature diversity, which adversely impacts feature representation and detail recovery. To respond this issue, we propose a simple yet effective baseline called CubeFormer, designed to enhance feature richness by completing holistic information aggregation. To be specific, we introduce cube attention, which expands 2D attention to 3D space, facilitating exhaustive information interactions, further encouraging comprehensive information extraction and promoting feature variety. In addition, we inject block and grid sampling strategies to construct intra-cube transformer blocks (Intra-CTB) and inter-cube transformer blocks (Inter-CTB), which perform local and global modeling, respectively. Extensive experiments show that our CubeFormer achieves state-of-the-art performance on commonly used SR benchmarks. Our source code and models will be publicly available.
