Noise2Noise: Learning Image Restoration without Clean Data
Jaakko Lehtinen, Jacob Munkberg, Jon Hasselgren, Samuli Laine, Tero Karras, Miika Aittala, Timo Aila
TL;DR
Noise2Noise shows that high-quality image restoration can be learned from corrupted data alone by leveraging the fact that corrupted targets preserve the underlying clean signal in expectation under common losses. The authors provide a theoretical justification and validate it across Gaussian, Poisson, Bernoulli, Monte Carlo rendering, and MRI undersampling tasks, demonstrating performance on par with clean-target training in many cases. Empirically, training with noisy targets often matches or surpasses clean-target training and offers substantial practical advantages, such as faster data generation (e.g., Monte Carlo rendering) and reduced data-collection burden. This work broadens the data-efficiency frontier for learned restoration and suggests new directions for combining forward-model information with corrupted-data training.
Abstract
We apply basic statistical reasoning to signal reconstruction by machine learning -- learning to map corrupted observations to clean signals -- with a simple and powerful conclusion: it is possible to learn to restore images by only looking at corrupted examples, at performance at and sometimes exceeding training using clean data, without explicit image priors or likelihood models of the corruption. In practice, we show that a single model learns photographic noise removal, denoising synthetic Monte Carlo images, and reconstruction of undersampled MRI scans -- all corrupted by different processes -- based on noisy data only.
