Learning Single-Image Super-Resolution in the JPEG Compressed Domain
Sruthi Srinivasan, Elham Shakibapour, Rajy Rawther, Mehdi Saeedi
TL;DR
This work targets the data-loading bottleneck in deep learning for image restoration by prototyping an end-to-end single-image super-resolution pipeline that operates directly on JPEG DCT coefficients, bypassing full JPEG decoding. The authors introduce FreqSR, a lightweight frequency-domain architecture that processes 64 DCT channels (one per 8×8 block) with depth-wise residual blocks, while focusing on the luminance Y channel and upsampling chrominance in post-processing. Empirical results show substantial data-loading and end-to-end training speedups (approximately 2.6× and 2.5×, respectively) with competitive or acceptable visual quality compared to RGB-based baselines, highlighting the practicality of compressed-domain learning for resource-constrained settings. The work suggests promising directions for edge AI, including extensions to video SR and other codecs, and emphasizes the need to further address misalignment artifacts and robustness in compressed-domain restoration.
Abstract
Deep learning models have grown increasingly complex, with input data sizes scaling accordingly. Despite substantial advances in specialized deep learning hardware, data loading continues to be a major bottleneck that limits training and inference speed. To address this challenge, we propose training models directly on encoded JPEG features, reducing the computational overhead associated with full JPEG decoding and significantly improving data loading efficiency. While prior works have focused on recognition tasks, we investigate the effectiveness of this approach for the restoration task of single-image super-resolution (SISR). We present a lightweight super-resolution pipeline that operates on JPEG discrete cosine transform (DCT) coefficients in the frequency domain. Our pipeline achieves a 2.6x speedup in data loading and a 2.5x speedup in training, while preserving visual quality comparable to standard SISR approaches.
