Principal Component Flow Map Learning of PDEs from Incomplete, Limited, and Noisy Data
Victor Churchill
TL;DR
The paper tackles learning the evolution of high-dimensional PDEs from partially observed, noisy data by introducing PC-FML, a reduced-basis flow-map framework. It jointly learns a linear reduced representation and a residual flow map in that reduced space, allowing large reductions in parameter count and training data needs while preserving predictive accuracy. The method provides fixed, constrained, or unconstrained options for the reduction and demonstrates superior stability and denoising performance across six challenging 1D and 2D PDEs, including Burgers, shallow water, wave, and Navier–Stokes equations. The approach enables rapid, locally executable high-resolution simulations on incomplete grids, with potential extensions to enforce conservation laws and physics-based constraints for further improvements.
Abstract
We present a computational technique for modeling the evolution of dynamical systems in a reduced basis, with a focus on the challenging problem of modeling partially-observed partial differential equations (PDEs) on high-dimensional non-uniform grids. We address limitations of previous work on data-driven flow map learning in the sense that we focus on noisy and limited data to move toward data collection scenarios in real-world applications. Leveraging recent work on modeling PDEs in modal and nodal spaces, we present a neural network structure that is suitable for PDE modeling with noisy and limited data available only on a subset of the state variables or computational domain. In particular, spatial grid-point measurements are reduced using a learned linear transformation, after which the dynamics are learned in this reduced basis before being transformed back out to the nodal space. This approach yields a drastically reduced parameterization of the neural network compared with previous flow map models for nodal space learning. This allows for rapid high-resolution simulations, enabled by smaller training data sets and reduced training times.
