Machine Learning for Inverse Problems and Data Assimilation
Eviatar Bach, Ricardo Baptista, Daniel Sanz-Alonso, Andrew Stuart
TL;DR
The notes articulate a unified framework for applying machine learning to inverse problems and data assimilation within a Bayesian setting. They develop variational and transport-based methods to learn priors, surrogates, and posterior maps, and prove stability and approximation guarantees, including posterior closeness under forward-model or likelihood perturbations. A central theme is learning and amortizing computation via surrogate forward models, pushforward priors, and transport maps to accelerate inference and sampling, with explicit attention to model error and data dependence. The data-assimilation portion then situates these ideas in sequential settings, detailing classical filters (Kalman variants) and particle methods, along with practical enhancements like inflation and localization. Overall, the work provides both theoretical foundations and algorithmic strategies for integrating ML into Bayesian inverse problems and data assimilation, with broad implications for efficient, robust, and scalable inference in complex systems.
Abstract
The aim of these notes is to demonstrate the potential for ideas in machine learning to impact on the fields of inverse problems and data assimilation. The perspective is one that is primarily aimed at researchers from inverse problems and/or data assimilation who wish to see a mathematical presentation of machine learning as it pertains to their fields. As a by-product, we include a succinct mathematical treatment of various fundamental underpinning topics in machine learning, and adjacent areas of (computational) mathematics.
