Physics-informed neural networks for the shallow-water equations on the sphere
Alex Bihlo, Roman O. Popovych
TL;DR
The paper demonstrates that physics-informed neural networks can solve the shallow-water equations on the sphere in a meteorological setting, addressing long-time integration challenges with a practical multi-model time-splitting approach and hard-boundary enforcement. By partitioning the integration interval into consecutive subintervals and training a sequence of networks, the method achieves accurate representations of advection, geostrophic balance, mountain-induced flows, and Rossby–Haurwitz waves using far fewer collocation points than traditional solvers. Results on Williamson et al. benchmarks show promising accuracy and meshless evaluation benefits, while discussions highlight limitations related to conservation properties and the need for parameterizations in realistic forecasting. The work points to future enhancements, including enforcing conservation laws as hard constraints and exploring parallel-in-time strategies to further reduce training times for operational-scale models.
Abstract
We propose the use of physics-informed neural networks for solving the shallow-water equations on the sphere in the meteorological context. Physics-informed neural networks are trained to satisfy the differential equations along with the prescribed initial and boundary data, and thus can be seen as an alternative approach to solving differential equations compared to traditional numerical approaches such as finite difference, finite volume or spectral methods. We discuss the training difficulties of physics-informed neural networks for the shallow-water equations on the sphere and propose a simple multi-model approach to tackle test cases of comparatively long time intervals. Here we train a sequence of neural networks instead of a single neural network for the entire integration interval. We also avoid the use of a boundary value loss by encoding the boundary conditions in a custom neural network layer. We illustrate the abilities of the method by solving the most prominent test cases proposed by Williamson et al. [J. Comput. Phys. 102 (1992), 211-224].
