Image Super-Resolution with Guarantees via Conformalized Generative Models
Eduardo Adame, Daniel Csillag, Guilherme Tegoni Goedert
TL;DR
The paper tackles trustworthy uncertainty quantification in diffusion-based image super-resolution by introducing a conformal-prediction framework that yields a trust mask M_α(X) calibrated on unlabeled high-resolution data. It defines a model-indecision map σ to generate score masks and supports multiple fidelity metrics D_p, including pointwise, neighborhood-averaged, and semantic variants, all with probabilistic guarantees. The method provably controls fidelity error and PSNR, is robust to data leakage, and remains model-agnostic, applicable even to APIs, with efficient calibration. Empirical results on LIU4K with SinSR demonstrate accurate, interpretable confidence regions and favorable comparisons to prior uncertainty methods, suggesting practical utility for deploying high-resolution generative models in real-world settings.
Abstract
The increasing use of generative ML foundation models for image restoration tasks such as super-resolution calls for robust and interpretable uncertainty quantification methods. We address this need by presenting a novel approach based on conformal prediction techniques to create a 'confidence mask' capable of reliably and intuitively communicating where the generated image can be trusted. Our method is adaptable to any black-box generative model, including those locked behind an opaque API, requires only easily attainable data for calibration, and is highly customizable via the choice of a local image similarity metric. We prove strong theoretical guarantees for our method that span fidelity error control (according to our local image similarity metric), reconstruction quality, and robustness in the face of data leakage. Finally, we empirically evaluate these results and establish our method's solid performance.
