Adaptive Gradient Enhanced Gaussian Process Surrogates for Inverse Problems
Phillip Semler, Martin Weiser
TL;DR
This work tackles the computational burden of constructing accurate surrogates for inverse problems with expensive forward models by employing gradient-enhanced Gaussian process regression (GEGPR) in a fully adaptive, budget-aware design. It introduces a two-part framework: (i) a gradient-inclusive GP surrogate that models both forward values and derivatives, and (ii) a greedy, two-stage design strategy that first selects promising evaluation points and then optimizes forward-model tolerances under a fixed work budget. The authors derive an accuracy model linking surrogate error to parameter reconstruction error and couple it with a power-law work model to guide adaptive data generation. Numerical experiments on an analytical 2D problem and a PDE-based scatterometry test demonstrate that including gradient information yields substantial efficiency gains (often two orders of magnitude) over value-only designs and over fixed-position approaches, with reconstruction achieving prescribed tolerances. The methodology enables more efficient offline data generation for surrogate-based inverse problem solvers, with potential impact on real-time parameter identification in computationally intensive applications.
Abstract
Generating simulated training data needed for constructing sufficiently accurate surrogate models to be used for efficient optimization or parameter identification can incur a huge computational effort in the offline phase. We consider a fully adaptive greedy approach to the computational design of experiments problem using gradient-enhanced Gaussian process regression as surrogates. Designs are incrementally defined by solving an optimization problem for accuracy given a certain computational budget. We address not only the choice of evaluation points but also of required simulation accuracy, both of values and gradients of the forward model. Numerical results show a significant reduction of the computational effort compared to just position-adaptive and static designs as well as a clear benefit of including gradient information into the surrogate training.
