Uncertainty Estimation for Super-Resolution using ESRGAN
Maniraj Sai Adapa, Marco Zullich, Matias Valdenegro-Toro
TL;DR
This work addresses the lack of principled predictive uncertainty in deep SR by integrating Monte Carlo Dropout and Deep Ensembles with state-of-the-art GAN-based models (SRGAN/ESRGAN) to produce per-pixel uncertainty maps alongside SR outputs. The authors demonstrate that uncertainty estimates are decently calibrated and do not degrade SR performance, with ensembles often achieving the best PSNR and reliable error–uncertainty correlations. The approach provides practical value by highlighting uncertain regions where SR may be inaccurate, potentially guiding human users and downstream systems. This enhances the interpretability and responsible deployment of SR in real-world applications where out-of-distribution inputs or fine textures pose challenges.
Abstract
Deep Learning-based image super-resolution (SR) has been gaining traction with the aid of Generative Adversarial Networks. Models like SRGAN and ESRGAN are constantly ranked between the best image SR tools. However, they lack principled ways for estimating predictive uncertainty. In the present work, we enhance these models using Monte Carlo-Dropout and Deep Ensemble, allowing the computation of predictive uncertainty. When coupled with a prediction, uncertainty estimates can provide more information to the model users, highlighting pixels where the SR output might be uncertain, hence potentially inaccurate, if these estimates were to be reliable. Our findings suggest that these uncertainty estimates are decently calibrated and can hence fulfill this goal, while providing no performance drop with respect to the corresponding models without uncertainty estimation.
