Adaptive Blind All-in-One Image Restoration
David Serrano-Lozano, Luis Herranz, Shaolin Su, Javier Vazquez-Corral
TL;DR
This paper addresses blind all-in-one image restoration by proposing ABAIR, a three-phase framework that generalizes to unseen and composite degradations while allowing efficient updates. It combines large-scale synthetic pretraining with Degradation CutMix and a per-pixel degradation segmentation head, task-specific LoRA adapters, and a lightweight degradation estimator to blend adapters on a per-image basis. The approach outperforms state-of-the-art methods on five-task and three-task IR benchmarks and demonstrates robust performance on unseen datasets and mixed degradations, with minimal additional training when adding new degradations. The results indicate strong practical potential for versatile IR in real-world conditions, where degradations are diverse and evolving.
Abstract
Blind all-in-one image restoration models aim to recover a high-quality image from an input degraded with unknown distortions. However, these models require all the possible degradation types to be defined during the training stage while showing limited generalization to unseen degradations, which limits their practical application in complex cases. In this paper, we introduce ABAIR, a simple yet effective adaptive blind all-in-one restoration model that not only handles multiple degradations and generalizes well to unseen distortions but also efficiently integrates new degradations by training only a small subset of parameters. We first train our baseline model on a large dataset of natural images with multiple synthetic degradations. To enhance its ability to recognize distortions, we incorporate a segmentation head that estimates per-pixel degradation types. Second, we adapt our initial model to varying image restoration tasks using independent low-rank adapters. Third, we learn to adaptively combine adapters to versatile images via a flexible and lightweight degradation estimator. This specialize-then-merge approach is both powerful in addressing specific distortions and flexible in adapting to complex tasks. Moreover, our model not only surpasses state-of-the-art performance on five- and three-task IR setups but also demonstrates superior generalization to unseen degradations and composite distortions.
