Learned iterative networks: An operator learning perspective
Andreas Hauptmann, Ozan Öktem
TL;DR
This work presents a unified operator-learning perspective for learned iterative networks in inverse problems, separating the learned reconstruction operator from the learning problem to reveal structural commonalities across gradient-based, proximal, variational, and primal–dual unrolled methods. It argues for structured learning—through symmetries, simulator-informed design, and multi-scale architectures—to enable robust, scalable solutions to linear and nonlinear inverse problems. The chapter surveys neural-operator architectures (convolutional operators, transformers, FNOs), discusses training paradigms (supervised, self-supervised, unpaired, greedy), and details a spectrum of learned iterative schemes, including learned gradient, proximal, variational, and primal–dual networks, plus higher-order Newton-type methods. Numerical examples illustrate that, in linear settings, different update directions yield similar performance, whereas in nonlinear problems the choice of update direction substantially affects results. Overall, the work provides a cohesive framework to design and analyze learned reconstructions, highlighting practical considerations around training, domain adaptation, and computational feasibility.
Abstract
Learned image reconstruction has become a pillar in computational imaging and inverse problems. Among the most successful approaches are learned iterative networks, which are formulated by unrolling classical iterative optimisation algorithms for solving variational problems. While the underlying algorithm is usually formulated in the functional analytic setting, learned approaches are often viewed as purely discrete. In this chapter we present a unified operator view for learned iterative networks. Specifically, we formulate a learned reconstruction operator, defining how to compute, and separately the learning problem, which defines what to compute. In this setting we present common approaches and show that many approaches are closely related in their core. We review linear as well as nonlinear inverse problems in this framework and present a short numerical study to conclude.
