Estimating and using information in inverse problems
Wolfgang Bangerth, Chris R. Johnson, Dennis K. Njeru, Bart van Bloemen Waanders
TL;DR
This work introduces information density as a spatially varying measure of how well coefficients in inverse problems can be recovered from indirect measurements, grounded in Bayesian Fisher information and the Cramér–Rao bound. It develops finite- and infinite-dimensional formulations, defines per-cell information j_k and density j(x), and shows how these quantities can inform regularization, discretization, and experimental design. Through a two-dimensional advection–diffusion test problem, the authors demonstrate that meshes refined by information content yield better-posed discretizations and more informative reconstructions than conventional refinement criteria. The approach provides a pre-measurement, physics-based tool to allocate computational and data-collection resources efficiently, with extensions to nonlinear problems and scalable computation discussed for future work.
Abstract
In inverse problems, one attempts to infer spatially variable functions from indirect measurements of a system. To practitioners of inverse problems, the concept of "information" is familiar when discussing key questions such as which parts of the function can be inferred accurately and which cannot. For example, it is generally understood that we can identify system parameters accurately only close to detectors, or along ray paths between sources and detectors, because we have "the most information" for these places. Although referenced in many publications, the "information" that is invoked in such contexts is not a well understood and clearly defined quantity. Herein, we present a definition of information density that is based on the variance of coefficients as derived from a Bayesian reformulation of the inverse problem. We then discuss three areas in which this information density can be useful in practical algorithms for the solution of inverse problems, and illustrate the usefulness in one of these areas -- how to choose the discretization mesh for the function to be reconstructed -- using numerical experiments.
