Efficient Deconvolution in Populational Inverse Problems
Arnaud Vadeboncoeur, Mark Girolami, Andrew M. Stuart
TL;DR
This work develops a framework for distributional inversion across populations of physical systems where the observational noise is unknown. It jointly learns a parameter distribution for PDEs and the noise distribution by minimizing a Wasserstein-based discrepancy between the data and model distributions, while employing a robust cut-gradient flow and an active-learning surrogate to handle expensive, potentially non-differentiable PDE solvers. The methodology is validated on porous media flow, elastodynamics, and time-averaged chaotic dynamics (Lorenz-96), demonstrating accurate recovery of noise characteristics and model parameters under both uncorrelated and correlated noise and across varied data regimes. The approach promises scalable, data-efficient inference for complex, population-based inverse problems in physics and climate modelling, with practical implications for uncertainty quantification and model calibration in engineering and geoscience contexts.
Abstract
This work is focussed on the inversion task of inferring the distribution over parameters of interest leading to multiple sets of observations. The potential to solve such distributional inversion problems is driven by increasing availability of data, but a major roadblock is blind deconvolution, arising when the observational noise distribution is unknown. However, when data originates from collections of physical systems, a population, it is possible to leverage this information to perform deconvolution. To this end, we propose a methodology leveraging large data sets of observations, collected from different instantiations of the same physical processes, to simultaneously deconvolve the data corrupting noise distribution, and to identify the distribution over model parameters defining the physical processes. A parameter-dependent mathematical model of the physical process is employed. A loss function characterizing the match between the observed data and the output of the mathematical model is defined; it is minimized as a function of the both the parameter inputs to the model of the physics and the parameterized observational noise. This coupled problem is addressed with a modified gradient descent algorithm that leverages specific structure in the noise model. Furthermore, a new active learning scheme is proposed, based on adaptive empirical measures, to train a surrogate model to be accurate in parameter regions of interest; this approach accelerates computation and enables automatic differentiation of black-box, potentially nondifferentiable, code computing parameter-to-solution maps. The proposed methodology is demonstrated on porous medium flow, damped elastodynamics, and simplified models of atmospheric dynamics.
