Deep Image Prior for Computed Tomography Reconstruction
Simon Arridge, Riccardo Barbano, Alexander Denker, Zeljko Kereta
TL;DR
The paper surveys the Deep Image Prior (DIP) framework for CT reconstruction, emphasizing its unsupervised, data-efficient nature that relies on a CNN’s architectural bias rather than large paired datasets. It reviews the vanilla DIP formulation and a broad set of extensions to mitigate overfitting and reduce computation, including explicit TV regularisation, learned denoisers (RED), self-guidance, warm-starts, subspace constraints, and stochastic optimization. The authors connect DIP behavior to theoretical analyses (Analytic Deep Prior and neural tangent kernel perspectives) and discuss under-/over-parametrised regimes, highlighting the spectral bias that governs reconstruction quality. They provide a comprehensive experimental comparison on real μCT walnut data, showing that regularised and denoising variants can outperform vanilla DIP and classical TV, with tradeoffs in speed and robustness. The work positions DIP as a data-efficient alternative for inverse CT and outlines future directions such as 3D extensions, diffusion-based priors, and deeper theoretical grounding of architectural priors.
Abstract
We present a comprehensive overview of the Deep Image Prior (DIP) framework and its applications to image reconstruction in computed tomography. Unlike conventional deep learning methods that rely on large, supervised datasets, the DIP exploits the implicit bias of convolutional neural networks and operates in a fully unsupervised setting, requiring only a single measurement, even in the presence of noise. We describe the standard DIP formulation, outline key algorithmic design choices, and review several strategies to mitigate overfitting, including early stopping, explicit regularisation, and self-guided methods that adapt the network input. In addition, we examine computational improvements such as warm-start and stochastic optimisation methods to reduce the reconstruction time. The discussed methods are tested on real $μ$CT measurements, which allows examination of trade-offs among the different modifications and extensions.
