Universal Robustness via Median Randomized Smoothing for Real-World Super-Resolution
Zakariya Chaouai, Mohamed Tamaazousti
TL;DR
This work tackles real-world robust single-image super-resolution by introducing CertSR, a universal robustness framework built on Median Randomized Smoothing (MRS). By applying Gaussian noise and per-pixel medians, CertSR provides certifiable $l_2$-robustness for SR outputs and demonstrates superior generalization across diverse adversarial attacks and real-world corruptions. The method extends adversarial attacks to SR (FGSM, BIM, PGD, CW) and shows that MRS-based certification yields better perceptual fidelity (LPIPS) while maintaining competitive PSNR/SSIM compared to prior RSR approaches. The practical impact lies in a robust SR pipeline that performs well without fine-tuning on specific real-world degradations, offering a transferable and scalable solution for real-world imaging tasks.
Abstract
Most of the recent literature on image Super-Resolution (SR) can be classified into two main approaches. The first one involves learning a corruption model tailored to a specific dataset, aiming to mimic the noise and corruption in low-resolution images, such as sensor noise. However, this approach is data-specific, tends to lack adaptability, and its accuracy diminishes when faced with unseen types of image corruptions. A second and more recent approach, referred to as Robust Super-Resolution (RSR), proposes to improve real-world SR by harnessing the generalization capabilities of a model by making it robust to adversarial attacks. To delve further into this second approach, our paper explores the universality of various methods for enhancing the robustness of deep learning SR models. In other words, we inquire: "Which robustness method exhibits the highest degree of adaptability when dealing with a wide range of adversarial attacks ?". Our extensive experimentation on both synthetic and real-world images empirically demonstrates that median randomized smoothing (MRS) is more general in terms of robustness compared to adversarial learning techniques, which tend to focus on specific types of attacks. Furthermore, as expected, we also illustrate that the proposed universal robust method enables the SR model to handle standard corruptions more effectively, such as blur and Gaussian noise, and notably, corruptions naturally present in real-world images. These results support the significance of shifting the paradigm in the development of real-world SR methods towards RSR, especially via MRS.
