Deep Internal Learning: Deep Learning from a Single Input
Tom Tirer, Raja Giryes, Se Young Chun, Yonina C. Eldar
TL;DR
This survey addresses deep internal learning, where a DNN is trained from a single input $x_0$ or adapted at inference time, to restore or generate signals when external data are scarce or mismatched. It surveys architecture-based approaches (e.g., Deep Image Prior, Deep Decoder, ZSSR, SinGAN) and optimization-based variants (e.g., DIP-SGLD, Self2Self, SURE/GSURE, BP-TV, PnP/RED), and then discusses test-time adaptation of pretrained models (IDBP-CNN-IA, IAGAN, diffusion-based ADIR) alongside meta-learning strategies (MZSR, MLSR) to reduce fine-tuning costs. A central theme is exploiting self-similarity and the implicit bias of overparameterized networks as priors, enabling high-quality restoration with minimal or no external data. The paper highlights practical trade-offs, such as the need for early stopping in pure internal learning and the computational burden of test-time adaptation, and points to open questions in theoretical guarantees and blind settings. Altogether, the survey frames a bridge between traditional signal-processing priors and modern deep-learning techniques to enable robust, data-efficient reconstruction and editing across imaging modalities and signals.
Abstract
Deep learning, in general, focuses on training a neural network from large labeled datasets. Yet, in many cases there is value in training a network just from the input at hand. This is particularly relevant in many signal and image processing problems where training data is scarce and diversity is large on the one hand, and on the other, there is a lot of structure in the data that can be exploited. Using this information is the key to deep internal-learning strategies, which may involve training a network from scratch using a single input or adapting an already trained network to a provided input example at inference time. This survey paper aims at covering deep internal-learning techniques that have been proposed in the past few years for these two important directions. While our main focus will be on image processing problems, most of the approaches that we survey are derived for general signals (vectors with recurring patterns that can be distinguished from noise) and are therefore applicable to other modalities.
