Diffusion Model-Based Posterior Sampling in Full Waveform Inversion
Mohammad H. Taufik, Tariq Alkhalifah
TL;DR
This work addresses uncertainty quantification in large-scale full waveform inversion by coupling an unconditional diffusion prior with encoded-shot likelihoods to enable posterior sampling with substantially fewer wave-equation solves. It introduces a decoupled sampling workflow that, at each diffusion level, first predicts a clean model, then performs a short stochastic Langevin refinement under the wave-equation data term, and finally re-noises to the next level, preserving mixing across levels. The method leverages encoded simultaneous-source data to reduce forward/adjoint solves while using a diffusion prior to suppress source-related artefacts, and it trains priors in 2D patches and 3D cubes for scalability. Compared to a strong SVGD baseline, the approach yields lower model and data misfits, better posterior calibration, and markedly lower computational cost, with demonstrated robustness across 2D synthetics and 3D upscaling. The framework enables calibrated posterior inference on shot records for large-scale 2D and 3D FWI and suggests avenues for efficiency gains and broader guidance integration.
Abstract
Bayesian full waveform inversion (FWI) offers uncertainty-aware subsurface models; however, posterior sampling directly on observed seismic shot records is rarely practical at the field scale because each sample requires numerous wave-equation solves. We aim to make such sampling feasible for large surveys while preserving calibration, that is, high uncertainty in less illuminated areas. Our approach couples diffusion-based posterior sampling with simultaneous-source FWI data. At each diffusion noise level, a network predicts a clean velocity model. We then apply a stochastic refinement step in model space using Langevin dynamics under the wave-equation likelihood and reintroduce noise to decouple successive levels before proceeding. Simultaneous-source batches reduce forward and adjoint solves approximately in proportion to the supergather size, while an unconditional diffusion prior trained on velocity patches and volumes helps suppress source-related numerical artefacts. We evaluate the method on three 2D synthetic datasets (SEG/EAGE Overthrust, SEG/EAGE Salt, SEAM Arid), a 2D field line, and a 3D upscaling study. Relative to a particle-based variational baseline, namely Stein variational gradient descent without a learned prior and with single-source (non-simultaneous-source) FWI, our sampler achieves lower model error and better data fit at a substantially reduced computational cost. By aligning encoded-shot likelihoods with diffusion-based sampling and exploiting straightforward parallelization over samples and source batches, the method provides a practical path to calibrated posterior inference on observed shot records that scales to large 2D and 3D problems.
