Learning of discrete models of variational PDEs from data
Christian Offen, Sina Ober-Blöbaum
TL;DR
This work presents a framework to learn discrete field theories directly on space-time lattices by training neural networks to represent a discrete Lagrangian density $L_d$ whose discrete Euler–Lagrange equations reproduce observed field data. A data-consistency loss combined with numerically informed regularisers guides the model toward nondegenerate, stable DEL systems, enabling accurate forward propagation and robust extrapolation. The approach preserves variational structure and, via Palais' principle of symmetric criticality, naturally captures highly symmetric solutions such as travelling waves—even when they are absent from the training data—contrasting with traditional model-order reduction that projects to latent variables. Demonstrations on the wave and Schrödinger equations show effective data fitting, accurate travelling-wave identification, and competitive performance against MOR baselines, with potential impact on structure-preserving surrogates for PDEs and conservation-law discovery.
Abstract
We show how to learn discrete field theories from observational data of fields on a space-time lattice. For this, we train a neural network model of a discrete Lagrangian density such that the discrete Euler--Lagrange equations are consistent with the given training data. We, thus, obtain a structure-preserving machine learning architecture. Lagrangian densities are not uniquely defined by the solutions of a field theory. We introduce a technique to derive regularisers for the training process which optimise numerical regularity of the discrete field theory. Minimisation of the regularisers guarantees that close to the training data the discrete field theory behaves robust and efficient when used in numerical simulations. Further, we show how to identify structurally simple solutions of the underlying continuous field theory such as travelling waves. This is possible even when travelling waves are not present in the training data. This is compared to data-driven model order reduction based approaches, which struggle to identify suitable latent spaces containing structurally simple solutions when these are not present in the training data. Ideas are demonstrated on examples based on the wave equation and the Schrödinger equation.
