Table of Contents
Fetching ...

Efficient sampling approaches based on generalized Golub-Kahan methods for large-scale hierarchical Bayesian inverse problems

Elle Buser, Julianne Chung

TL;DR

This work tackles uncertainty quantification for large-scale hierarchical Bayesian inverse problems by developing Metropolis-Hastings independence samplers within a Gibbs framework. It introduces two genGK-based proposal strategies: a low-rank genGK approximation to the conditional covariance and a preconditioned Lanczos method for exact square-root sampling of the conditional covariance, both designed to circumvent expensive high-dimensional Gaussian draws when the prior covariance %5Cmathbf{Q}%5C is large or ill-conditioned. The authors provide explicit formulations for acceptance ratios and show how genGK subspaces can be reused across hyperparameters, enabling scalable sampling across dynamic, seismic, and atmospheric inverse problems. Numerical results on seismic tomography, atmospheric transport, and dynamic photoacoustic tomography demonstrate competitive acceptance rates, effective sample sizes, and good equilibrium behavior, validating the methods’ applicability to real-world, large-scale UQ tasks.

Abstract

Uncertainty quantification for large-scale inverse problems remains a challenging task. For linear inverse problems with additive Gaussian noise and Gaussian priors, the posterior is Gaussian but sampling can be challenging, especially for problems with a very large number of unknown parameters (e.g., dynamic inverse problems) and for problems where computation of the square root and inverse of the prior covariance matrix are not feasible. Moreover, for hierarchical problems where several hyperparameters that define the prior and the noise model must be estimated from the data, the posterior distribution may no longer be Gaussian, even if the forward operator is linear. Performing large-scale uncertainty quantification for these hierarchical settings requires new computational techniques. In this work, we consider a hierarchical Bayesian framework where both the noise and prior variance are modeled as hyperparameters. Our approach uses Metropolis-Hastings independence sampling within Gibbs where the proposal distribution is based on generalized Golub-Kahan based methods. We consider two proposal samplers, one that uses a low rank approximation to the conditional covariance matrix and another that uses a preconditioned Lanczos method. Numerical examples from seismic imaging, dynamic photoacoustic tomography, and atmospheric inverse modeling demonstrate the effectiveness of the described approaches.

Efficient sampling approaches based on generalized Golub-Kahan methods for large-scale hierarchical Bayesian inverse problems

TL;DR

This work tackles uncertainty quantification for large-scale hierarchical Bayesian inverse problems by developing Metropolis-Hastings independence samplers within a Gibbs framework. It introduces two genGK-based proposal strategies: a low-rank genGK approximation to the conditional covariance and a preconditioned Lanczos method for exact square-root sampling of the conditional covariance, both designed to circumvent expensive high-dimensional Gaussian draws when the prior covariance %5Cmathbf{Q}%5C is large or ill-conditioned. The authors provide explicit formulations for acceptance ratios and show how genGK subspaces can be reused across hyperparameters, enabling scalable sampling across dynamic, seismic, and atmospheric inverse problems. Numerical results on seismic tomography, atmospheric transport, and dynamic photoacoustic tomography demonstrate competitive acceptance rates, effective sample sizes, and good equilibrium behavior, validating the methods’ applicability to real-world, large-scale UQ tasks.

Abstract

Uncertainty quantification for large-scale inverse problems remains a challenging task. For linear inverse problems with additive Gaussian noise and Gaussian priors, the posterior is Gaussian but sampling can be challenging, especially for problems with a very large number of unknown parameters (e.g., dynamic inverse problems) and for problems where computation of the square root and inverse of the prior covariance matrix are not feasible. Moreover, for hierarchical problems where several hyperparameters that define the prior and the noise model must be estimated from the data, the posterior distribution may no longer be Gaussian, even if the forward operator is linear. Performing large-scale uncertainty quantification for these hierarchical settings requires new computational techniques. In this work, we consider a hierarchical Bayesian framework where both the noise and prior variance are modeled as hyperparameters. Our approach uses Metropolis-Hastings independence sampling within Gibbs where the proposal distribution is based on generalized Golub-Kahan based methods. We consider two proposal samplers, one that uses a low rank approximation to the conditional covariance matrix and another that uses a preconditioned Lanczos method. Numerical examples from seismic imaging, dynamic photoacoustic tomography, and atmospheric inverse modeling demonstrate the effectiveness of the described approaches.

Paper Structure

This paper contains 16 sections, 57 equations, 5 figures, 7 tables, 4 algorithms.

Figures (5)

  • Figure 1: True reconstruction for $36\times36$PRspherical seismic tomography test problem
  • Figure 2: True atmospheric image and results for $T=500$ samples from Algorithm \ref{['alg:gibbs_genGK']} for the atmospheric transport problem with an approximate covariance matrix of rank $k=750$. The mean and variance are taken over the 315/500 accepted samples after burn-in. The $\lambda$ and $\delta$ distributions are over all samples after burn-in.
  • Figure 3: True $64\times 64$ images for the dynamic photoacoustic tomography problem at time points $i=1,5,10,15, \rm and \ 20$.
  • Figure 4: Results for 500/500 accepted samples from Algorithm \ref{['alg:gibbs_precond']} for the dynamic photoacoustic tomography test problem with an approximate mean with $k=1,000$ genGK iterations for time points $i=1,5,10,15,$ and $20$. Top: Random sample from the prior. Middle: Means of accepted samples after burn-in. Bottom: Variances of accepted samples after burn-in.
  • Figure 5: $\lambda$ and $\delta$ distributions of $T=500$ samples from Algorithm \ref{['alg:gibbs_precond']} for the dynamic photoacoustic tomography test problem with an approximate mean with $k=1,000$ genGK iterations