FoundIR-v2: Optimizing Pre-Training Data Mixtures for Image Restoration Foundation Model
Xiang Chen, Jinshan Pan, Jiangxin Dong, Jian Yang, Jinhui Tang
TL;DR
The paper tackles the challenge of universal image restoration by optimizing how training data is mixture-mixed across tasks. It proposes FoundIR-v2, which combines data equilibrium scheduling to balance multi-task learning with a Mixture-of-Experts diffusion scheduler to allocate task-specific priors in a latent diffusion framework. Key contributions include formalizing data mixing laws for restoration, integrating MoE-guided priors, and demonstrating strong performance across 50+ subtasks and numerous benchmarks. The results indicate that dynamic data and model scheduling yield superior generalization and practical applicability in real-world restoration scenarios.
Abstract
Recent studies have witnessed significant advances in image restoration foundation models driven by improvements in the scale and quality of pre-training data. In this work, we find that the data mixture proportions from different restoration tasks are also a critical factor directly determining the overall performance of all-in-one image restoration models. To this end, we propose a high-capacity diffusion-based image restoration foundation model, FoundIR-v2, which adopts a data equilibrium scheduling paradigm to dynamically optimize the proportions of mixed training datasets from different tasks. By leveraging the data mixing law, our method ensures a balanced dataset composition, enabling the model to achieve consistent generalization and comprehensive performance across diverse tasks. Furthermore, we introduce an effective Mixture-of-Experts (MoE)-driven scheduler into generative pre-training to flexibly allocate task-adaptive diffusion priors for each restoration task, accounting for the distinct degradation forms and levels exhibited by different tasks. Extensive experiments demonstrate that our method can address over 50 sub-tasks across a broader scope of real-world scenarios and achieves favorable performance against state-of-the-art approaches.
