Real-time Bayesian inference at extreme scale: A digital twin for tsunami early warning applied to the Cascadia subduction zone
Stefan Henneking, Sreeram Venkat, Veselin Dobrev, John Camier, Tzanio Kolev, Milinda Fernando, Alice-Agnes Gabriel, Omar Ghattas
TL;DR
This work develops a real-time Bayesian digital twin for tsunami early warning in the Cascadia subduction zone by coupling seafloor acoustic data with a 3D acoustic–gravity PDE. It achieves an offline-online decomposition and FFT-based Hessian matvecs that convert an intractable billion-parameter inverse problem into real-time inference and forecasting, scaling efficiently on up to 43,520 GPUs. The online phase resolves the MAP estimate and QoI predictions in under 0.2 seconds, after an offline precomputation of adjoint solutions and data-space transforms that would otherwise require prohibitive PDE solves. The approach delivers quantified uncertainty in tsunami forecasts and holds promise for real-world deployment, with potential extensions to broader geophysical and inverse-scattering problems.
Abstract
We present a Bayesian inversion-based digital twin that employs acoustic pressure data from seafloor sensors, along with 3D coupled acoustic-gravity wave equations, to infer earthquake-induced spatiotemporal seafloor motion in real time and forecast tsunami propagation toward coastlines for early warning with quantified uncertainties. Our target is the Cascadia subduction zone, with one billion parameters. Computing the posterior mean alone would require 50 years on a 512 GPU machine. Instead, exploiting the shift invariance of the parameter-to-observable map and devising novel parallel algorithms, we induce a fast offline-online decomposition. The offline component requires just one adjoint wave propagation per sensor; using MFEM, we scale this part of the computation to the full El Capitan system (43,520 GPUs) with 92% weak parallel efficiency. Moreover, given real-time data, the online component exactly solves the Bayesian inverse and forecasting problems in 0.2 seconds on a modest GPU system, a ten-billion-fold speedup.
