QUIET-SR: Quantum Image Enhancement Transformer for Single Image Super-Resolution
Siddhant Dutta, Nouhaila Innan, Khadijeh Najafi, Sadok Ben Yahia, Muhammad Shafique
TL;DR
The paper tackles single-image super-resolution under the constraints of near-term quantum hardware by introducing QUIET-SR, a hybrid quantum-classical transformer that integrates Shifted Quantum Window attention built on variational quantum circuits. It extends the Swin Transformer with quantum attention to capture non-local, high-dimensional feature interactions while keeping circuit depth and qubit requirements suitable for NISQ devices. Empirical results on MNIST, FashionMNIST, and MedMNIST show competitive PSNR/SSIM with a compact model (~1.55 MB) and demonstrate robustness to realistic quantum noise, supported by analyses of long-range dependencies (Distance Correlation and HSIC). The paper also proposes scalable batching for multi-QPU–GPU quantum computing and offers a concrete resource and performance framework that informs future quantum vision research.
Abstract
Recent advancements in Single-Image Super-Resolution (SISR) using deep learning have significantly improved image restoration quality. However, the high computational cost of processing high-resolution images due to the large number of parameters in classical models, along with the scalability challenges of quantum algorithms for image processing, remains a major obstacle. In this paper, we propose the Quantum Image Enhancement Transformer for Super-Resolution (QUIET-SR), a hybrid framework that extends the Swin transformer architecture with a novel shifted quantum window attention mechanism, built upon variational quantum neural networks. QUIET-SR effectively captures complex residual mappings between low-resolution and high-resolution images, leveraging quantum attention mechanisms to enhance feature extraction and image restoration while requiring a minimal number of qubits, making it suitable for the Noisy Intermediate-Scale Quantum (NISQ) era. We evaluate our framework in MNIST (30.24 PSNR, 0.989 SSIM), FashionMNIST (29.76 PSNR, 0.976 SSIM) and the MedMNIST dataset collection, demonstrating that QUIET-SR achieves PSNR and SSIM scores comparable to state-of-the-art methods while using fewer parameters. Our efficient batching strategy directly enables massive parallelization on multiple QPU's paving the way for practical quantum-enhanced image super-resolution through coordinated QPU-GPU quantum supercomputing.
