FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration

Hao Li; Xiang Chen; Jiangxin Dong; Jinhui Tang; Jinshan Pan

FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration

Hao Li, Xiang Chen, Jiangxin Dong, Jinhui Tang, Jinshan Pan

TL;DR

This work confronts the limited real-world generalization of universal image restoration by introducing a million-scale real-world paired dataset gathered with a robotic shooting system and a robust two-tier model, FoundIR. FoundIR combines a diffusion-based degradation-agnostic generalist with degradation-aware specialists in an ensemble, supported by an incremental learning strategy to scale with data while mitigating forgetting. Empirical results across 24 benchmarks and public datasets show state-of-the-art performance and strong generalization, highlighting the dataset’s value and the method’s effectiveness in handling diverse real-world degradations. The approach has practical implications for building more reliable foundation-like models in image restoration and sets a new direction for dataset scale and training strategies in this domain.

Abstract

Despite the significant progress made by all-in-one models in universal image restoration, existing methods suffer from a generalization bottleneck in real-world scenarios, as they are mostly trained on small-scale synthetic datasets with limited degradations. Therefore, large-scale high-quality real-world training data is urgently needed to facilitate the emergence of foundational models for image restoration. To advance this field, we spare no effort in contributing a million-scale dataset with two notable advantages over existing training data: real-world samples with larger-scale, and degradation types with higher diversity. By adjusting internal camera settings and external imaging conditions, we can capture aligned image pairs using our well-designed data acquisition system over multiple rounds and our data alignment criterion. Moreover, we propose a robust model, FoundIR, to better address a broader range of restoration tasks in real-world scenarios, taking a further step toward foundation models. Specifically, we first utilize a diffusion-based generalist model to remove degradations by learning the degradation-agnostic common representations from diverse inputs, where incremental learning strategy is adopted to better guide model training. To refine the model's restoration capability in complex scenarios, we introduce degradation-aware specialist models for achieving final high-quality results. Extensive experiments show the value of our dataset and the effectiveness of our method.

FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration

TL;DR

Abstract

FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)