Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration
Chu-Jie Qin, Rui-Qi Wu, Zikun Liu, Xin Lin, Chun-Le Guo, Hyun Hee Park, Chongyi Li
TL;DR
This work tackles all-in-one blind image restoration by reframing learning around intrinsic image content rather than degradation priors. It introduces RAM, a two-stage pipeline with MIM-based pretraining on masked degraded images and MAC-guided fine-tuning of a small subset of layers to bridge input integrity gaps while preserving learned priors. The approach yields state-of-the-art or competitive results across multiple degradation tasks and architectures, with robust ablations supporting the effectiveness of 1×1 masking, 50% masking, paired pretraining data, and MAC-driven layer selection. The method offers a scalable, plug-and-play solution for unified restoration and has practical implications for real-world imaging systems and downstream tasks.
Abstract
All-in-one image restoration aims to handle multiple degradation types using one model. This paper proposes a simple pipeline for all-in-one blind image restoration to Restore Anything with Masks (RAM). We focus on the image content by utilizing Mask Image Modeling to extract intrinsic image information rather than distinguishing degradation types like other methods. Our pipeline consists of two stages: masked image pre-training and fine-tuning with mask attribute conductance. We design a straightforward masking pre-training approach specifically tailored for all-in-one image restoration. This approach enhances networks to prioritize the extraction of image content priors from various degradations, resulting in a more balanced performance across different restoration tasks and achieving stronger overall results. To bridge the gap of input integrity while preserving learned image priors as much as possible, we selectively fine-tuned a small portion of the layers. Specifically, the importance of each layer is ranked by the proposed Mask Attribute Conductance (MAC), and the layers with higher contributions are selected for finetuning. Extensive experiments demonstrate that our method achieves state-of-the-art performance. Our code and model will be released at \href{https://github.com/Dragonisss/RAM}{https://github.com/Dragonisss/RAM}.
