Table of Contents
Fetching ...

Bayesian learning with Gaussian processes for low-dimensional representations of time-dependent nonlinear systems

Shane A. McQuarrie, Anirban Chaudhuri, Karen E. Willcox, Mengwu Guo

TL;DR

This work develops GP-BayesOpInf, a physics-aware probabilistic surrogate for time-dependent nonlinear systems that couples Gaussian process regression with operator inference to produce reduced-order models with quantified uncertainty. By modeling reduced states with GPs and enforcing the reduced dynamics through a Bayesian residual-based likelihood, the method yields a closed-form posterior for the reduced operators and propagates data uncertainty to ROM predictions via Monte Carlo sampling. The approach is demonstrated on compressible Euler and nonlinear diffusion-reaction PDEs, including scenarios with noisy and sparse data and multiple trajectories, and extended to Bayesian parameter estimation for ODEs such as SEIRD. Results show accurate training fits and meaningful predictive uncertainty, with robustness to data quality and the ability to extrapolate to new parameter regimes. The framework enables efficient, many-query simulations with embedded uncertainty quantification for complex physics-based systems.

Abstract

This work presents a data-driven method for learning low-dimensional time-dependent physics-based surrogate models whose predictions are endowed with uncertainty estimates. We use the operator inference approach to model reduction that poses the problem of learning low-dimensional model terms as a regression of state space data and corresponding time derivatives by minimizing the residual of reduced system equations. Standard operator inference models perform well with accurate training data that are dense in time, but producing stable and accurate models when the state data are noisy and/or sparse in time remains a challenge. Another challenge is the lack of uncertainty estimation for the predictions from the operator inference models. Our approach addresses these challenges by incorporating Gaussian process surrogates into the operator inference framework to (1) probabilistically describe uncertainties in the state predictions and (2) procure analytical time derivative estimates with quantified uncertainties. The formulation leads to a generalized least-squares regression and, ultimately, reduced-order models that are described probabilistically with a closed-form expression for the posterior distribution of the operators. The resulting probabilistic surrogate model propagates uncertainties from the observed state data to reduced-order predictions. We demonstrate the method is effective for constructing low-dimensional models of two nonlinear partial differential equations representing a compressible flow and a nonlinear diffusion-reaction process, as well as for estimating the parameters of a low-dimensional system of nonlinear ordinary differential equations representing compartmental models in epidemiology.

Bayesian learning with Gaussian processes for low-dimensional representations of time-dependent nonlinear systems

TL;DR

This work develops GP-BayesOpInf, a physics-aware probabilistic surrogate for time-dependent nonlinear systems that couples Gaussian process regression with operator inference to produce reduced-order models with quantified uncertainty. By modeling reduced states with GPs and enforcing the reduced dynamics through a Bayesian residual-based likelihood, the method yields a closed-form posterior for the reduced operators and propagates data uncertainty to ROM predictions via Monte Carlo sampling. The approach is demonstrated on compressible Euler and nonlinear diffusion-reaction PDEs, including scenarios with noisy and sparse data and multiple trajectories, and extended to Bayesian parameter estimation for ODEs such as SEIRD. Results show accurate training fits and meaningful predictive uncertainty, with robustness to data quality and the ability to extrapolate to new parameter regimes. The framework enables efficient, many-query simulations with embedded uncertainty quantification for complex physics-based systems.

Abstract

This work presents a data-driven method for learning low-dimensional time-dependent physics-based surrogate models whose predictions are endowed with uncertainty estimates. We use the operator inference approach to model reduction that poses the problem of learning low-dimensional model terms as a regression of state space data and corresponding time derivatives by minimizing the residual of reduced system equations. Standard operator inference models perform well with accurate training data that are dense in time, but producing stable and accurate models when the state data are noisy and/or sparse in time remains a challenge. Another challenge is the lack of uncertainty estimation for the predictions from the operator inference models. Our approach addresses these challenges by incorporating Gaussian process surrogates into the operator inference framework to (1) probabilistically describe uncertainties in the state predictions and (2) procure analytical time derivative estimates with quantified uncertainties. The formulation leads to a generalized least-squares regression and, ultimately, reduced-order models that are described probabilistically with a closed-form expression for the posterior distribution of the operators. The resulting probabilistic surrogate model propagates uncertainties from the observed state data to reduced-order predictions. We demonstrate the method is effective for constructing low-dimensional models of two nonlinear partial differential equations representing a compressible flow and a nonlinear diffusion-reaction process, as well as for estimating the parameters of a low-dimensional system of nonlinear ordinary differential equations representing compartmental models in epidemiology.
Paper Structure (35 sections, 69 equations, 19 figures, 1 table, 8 algorithms)

This paper contains 35 sections, 69 equations, 19 figures, 1 table, 8 algorithms.

Figures (19)

  • Figure 1: Flow diagram of GP-BayesOpInf via \ref{['alg:GP-BayesOpInf']}.
  • Figure 2: Initial conditions for the Euler system \ref{['eq:euler-conservative']}--\ref{['eq:periodicBCs']} (left) and noisy state observations (right) expressed in the specific volume variables $(v,p,\zeta)$. The initial conditions are constructed with periodic splines interpolating the marked points. On the right, the dashed lines show the numerical solution without noise as a function of time at four fixed points in space; the markers indicate $m = 50$ sparsely observed states with $\xi = 1\%$ relative noise. Though data are only shown here at select spatial locations, they are observed at every point in the spatial discretization.
  • Figure 3: Singular value decay of $m=200$ snapshots with $\xi=3\%$ relative noise (left) and trained GPs for reduced modes $5$, $6$, and $7$, compressed via centered POD (right). The shaded regions are within three standard deviations of each GP mean. The POD compression filters much of the noise in the dominant modes, hence each GP exhibits a tight fit to the data until the spectral gap after $r=6$ modes.
  • Figure 4: True reduced state time derivatives, as well as GP and finite difference estimates of the time derivatives, using $m = 200$ observations with $\xi = 3\%$ relative noise. The time derivative GPs have small variances and produce highly accurate estimates (left), unlike the finite difference estimates, which in some instances differ from the truth by an order of magnitude (right).
  • Figure 5: The dominant mode of the Bayesian ROM predictions for the Euler system \ref{['eq:euler-conservative']}--\ref{['eq:periodicBCs']} using $m=200$ observations over the time domain $[0, 0.06]$ with $\xi=3\%$ relative noise. Fifty samples are taken of the posterior operator distribution and the corresponding ROM predictions are displayed as a function of time (left). The uncertainty of the prediction distribution is characterized by computing the mean and the $95\%$ interquantile range (IQR) of the sample predictions at each point in time (right).
  • ...and 14 more figures

Theorems & Definitions (6)

  • Remark 3.1: Choosing $m'$
  • Remark 3.2: Interpretation of the likelihood
  • Remark 3.3: Gaussianity preserved by affine reconstruction
  • Remark 3.4: Regularization of the covariance computation
  • Remark 3.5: Parametric problems
  • Remark 6.1: Equation fitting vs. trajectory fitting