Assessing Image Inpainting via Re-Inpainting Self-Consistency Evaluation
Tianyi Chen, Jianfu Zhang, Yan Hong, Yiyi Zhang, Liqing Zhang
TL;DR
The paper tackles biases in inpainting evaluation that arise when ground-truth unmasked references are required. It introduces a self-supervised, multi-pass re-inpainting framework that measures self-consistency across re-inpainted variants by applying a second mask and a second inpainting network, yielding the metric $D(F_1) = (1/K) \sum_{k=1}^K d(\hat{X}_1, \hat{X}_2^k)$. By using patch masks and LPIPS as the sub-metric, the method remains robust to different second-network choices and mask configurations, and it does not rely on the original unmasked image. Extensive experiments on Places2 with five diverse inpainting methods show that the proposed framework correlates well with human judgments and NR-IQA baselines while mitigating biases associated with traditional evaluation metrics.
Abstract
Image inpainting, the task of reconstructing missing segments in corrupted images using available data, faces challenges in ensuring consistency and fidelity, especially under information-scarce conditions. Traditional evaluation methods, heavily dependent on the existence of unmasked reference images, inherently favor certain inpainting outcomes, introducing biases. Addressing this issue, we introduce an innovative evaluation paradigm that utilizes a self-supervised metric based on multiple re-inpainting passes. This approach, diverging from conventional reliance on direct comparisons in pixel or feature space with original images, emphasizes the principle of self-consistency to enable the exploration of various viable inpainting solutions, effectively reducing biases. Our extensive experiments across numerous benchmarks validate the alignment of our evaluation method with human judgment.
