The Perception-Robustness Tradeoff in Deterministic Image Restoration
Guy Ohayon, Tomer Michaeli, Michael Elad
TL;DR
This work proves a fundamental limitation: for non-invertible degradations, any deterministic image-restoration estimator achieving high joint perceptual quality must have a large Lipschitz constant, making it vulnerable to adversarial perturbations. The authors formalize a bound Lip$\left(\hat{X}\right) \ge \frac{m_1}{\sqrt{W_p(p_{X,Y},p_{hat{X},Y})}}-m_2$, connect it to the Wasserstein-based joint perceptual index, and validate the tradeoff through toy and real single-image super-resolution experiments across multiple degradations. They show that smaller joint perceptual distance (better perceptual quality and consistency) correlates with increased instability, but also demonstrate how this instability can be exploited to imitate stochastic posterior sampling via input perturbations (FPS-style exploration). The results highlight a practical tension between perceptual fidelity and robustness, with implications for attack surfaces and uncertainty quantification in restoration systems, and point to a path for posterior sampling using deterministic models. The work thus provides both a cautionary perspective on deterministic restorers and a tool for posterior-like exploration in imaging pipelines.
Abstract
We study the behavior of deterministic methods for solving inverse problems in imaging. These methods are commonly designed to achieve two goals: (1) attaining high perceptual quality, and (2) generating reconstructions that are consistent with the measurements. We provide a rigorous proof that the better a predictor satisfies these two requirements, the larger its Lipschitz constant must be, regardless of the nature of the degradation involved. In particular, to approach perfect perceptual quality and perfect consistency, the Lipschitz constant of the model must grow to infinity. This implies that such methods are necessarily more susceptible to adversarial attacks. We demonstrate our theory on single image super-resolution algorithms, addressing both noisy and noiseless settings. We also show how this undesired behavior can be leveraged to explore the posterior distribution, thereby allowing the deterministic model to imitate stochastic methods.
