NTIRE 2024 Challenge on Stereo Image Super-Resolution: Methods and Results
Longguang Wang, Yulan Guo, Juncheng Li, Hongda Liu, Yang Zhao, Yingqian Wang, Zhi Jin, Shuhang Gu, Radu Timofte
TL;DR
The NTIRE 2024 Challenge on Stereo Image SR investigates x4 super-resolution for LR stereo pairs under tight computational budgets, emphasizing cross-view information for stereo-consistent reconstruction. It surveys related single-image SR and stereo SR work, details a Flickr1024-based dataset with two degradation tracks, and reports finalist performances that predominantly fuse Transformer backbones with parallax-attention-based cross-view fusion under strict size and MAC constraints. Top solutions leverage efficient cross-view interaction (PAM, SCAM, MCAM) and lightweight convolutional blocks (depthwise/NAF variants, Mamba-inspired ideas) to balance accuracy and efficiency, establishing a new benchmark for resource-constrained stereo SR. The results guide practical design choices for real-world stereo SR systems and highlight the continued importance of cross-view cues and efficient architectures.
Abstract
This paper summarizes the 3rd NTIRE challenge on stereo image super-resolution (SR) with a focus on new solutions and results. The task of this challenge is to super-resolve a low-resolution stereo image pair to a high-resolution one with a magnification factor of x4 under a limited computational budget. Compared with single image SR, the major challenge of this challenge lies in how to exploit additional information in another viewpoint and how to maintain stereo consistency in the results. This challenge has 2 tracks, including one track on bicubic degradation and one track on real degradations. In total, 108 and 70 participants were successfully registered for each track, respectively. In the test phase, 14 and 13 teams successfully submitted valid results with PSNR (RGB) scores better than the baseline. This challenge establishes a new benchmark for stereo image SR.
