Inverse Problems with Learned Forward Operators
Simon Arridge, Andreas Hauptmann, Yury Korolev
TL;DR
This work analyzes two data-driven paradigms for inverse problems with learned forward operators: (i) regularisation by projection, which learns the forward operator restricted to a training subspace and uses projection-based minimisation, and (ii) data-driven model correction, which learns a correction to a cheaper approximate forward model. It develops the mathematical framework for both approaches, including convergence results for injective and non-injective cases, and extends to explicit forward–adjoint corrections with convergence guarantees under gradient-alignment conditions. The theory highlights that successful reconstructions benefit from training data not only for the forward operator but also for its adjoint, a recurring theme across sections. The chapter substantiates these ideas with PAT experiments, showing that learned forward/normal operators perform well under projection-based regularisation, while explicit model corrections can yield strong improvements when trained adequately (including trajectory-aware strategies). Overall, the work clarifies how data-driven corrections and subspace projections can yield computationally efficient yet accurate reconstructions, and it points to practical training strategies that leverage adjoint information and manifold-based approaches for scalability.
Abstract
Solving inverse problems requires the knowledge of the forward operator, but accurate models can be computationally expensive and hence cheaper variants that do not compromise the reconstruction quality are desired. This chapter reviews reconstruction methods in inverse problems with learned forward operators that follow two different paradigms. The first one is completely agnostic to the forward operator and learns its restriction to the subspace spanned by the training data. The framework of regularisation by projection is then used to find a reconstruction. The second one uses a simplified model of the physics of the measurement process and only relies on the training data to learn a model correction. We present the theory of these two approaches and compare them numerically. A common theme emerges: both methods require, or at least benefit from, training data not only for the forward operator, but also for its adjoint.
