Adaptive Physics Transformer with Fused Global-Local Attention for Subsurface Energy Systems
Xin Ju, Nok Hei, Fung, Yuyan Zhang, Carl Jacquemyn, Matthew Jackson, Randolph Settgast, Sally M. Benson, Gege Wen
TL;DR
The paper tackles the computational cost and rigidity of full-physics subsurface simulations by introducing the Adaptive Physics Transformer (APT), a mesh-agnostic neural operator that fuses global attention with local graph-based encoding to capture multi-scale subsurface dynamics. By directly learning from adaptive meshes and enabling cross-dataset pretraining, APT achieves state-of-the-art accuracy across irregular, Cartesian, and adaptive grids, while delivering substantial speedups over traditional simulators. Key contributions include the fused Global Perceiver and Local GNO encoder design, a gated fusion mechanism, direct learning from dynamic meshes, and demonstrated robustness in out-of-distribution and cross-dataset settings, positioning APT as a scalable backbone for subsurface foundation models. The practical impact lies in enabling fast, accurate, and transferable surrogate models for carbon storage, geothermal energy, and hydrocarbon systems, thereby accelerating decision-making and energy-transition deployment.
Abstract
The Earth's subsurface is a cornerstone of modern society, providing essential energy resources like hydrocarbons, geothermal, and minerals while serving as the primary reservoir for $CO_2$ sequestration. However, full physics numerical simulations of these systems are notoriously computationally expensive due to geological heterogeneity, high resolution requirements, and the tight coupling of physical processes with distinct propagation time scales. Here we propose the \textbf{Adaptive Physics Transformer} (APT), a geometry-, mesh-, and physics-agnostic neural operator that explicitly addresses these challenges. APT fuses a graph-based encoder to extract high-resolution local heterogeneous features with a global attention mechanism to resolve long-range physical impacts. Our results demonstrate that APT outperforms state-of-the-art architectures in subsurface tasks across both regular and irregular grids with robust super-resolution capabilities. Notably, APT is the first architecture that directly learns from adaptive mesh refinement simulations. We also demonstrate APT's capability for cross-dataset learning, positioning it as a robust and scalable backbone for large-scale subsurface foundation model development.
