Pseudo-Hamiltonian system identification
Sigurd Holmsen, Sølve Eidnes, Signe Riemer-Sørensen
TL;DR
The paper tackles the challenge of identifying dynamical systems from data when external disturbances and damping obscure the internal dynamics. It introduces pseudo-Hamiltonian system identification (PHSI), learning the internal Hamiltonian $H$, damping $R$, and external forces $F$ (with a neural network for $F$ in a hybrid model) under a known structure matrix $S$ and a loss based on a fourth-order symmetric integrator: $\hat{g}_{\theta}(x,t) = (\hat{S}_{\theta}(x) - \hat{R}_{\theta}(x)) \nabla \hat{H}_{\theta}(x) + \hat{F}_{\theta}(x,t)$. Key contributions include pruning and $L_1$ regularization to promote sparsity and identifiability, robust performance on noisy data, and the ability to separate internal dynamics from external forces across separable, non-separable, and pseudo-Hamiltonian systems, with demonstrations on mass–spring, Hénon–Heiles, nonlinear Schrödinger, and tank-leak networks. The approach yields improved extrapolation and interpretability over baseline system identification methods and is supported by open-source code, enabling broader application to realistic dynamical problems. Overall, PHSI broadens the applicability of physics-informed structure learning to complex, real-world systems where disturbances cannot be neglected.
Abstract
Identifying the underlying dynamics of physical systems can be challenging when only provided with observational data. In this work, we consider systems that can be modelled as first-order ordinary differential equations. By assuming a certain pseudo-Hamiltonian formulation, we are able to learn the analytic terms of internal dynamics even if the model is trained on data where the system is affected by unknown damping and external disturbances. In cases where it is difficult to find analytic terms for the disturbances, a hybrid model that uses a neural network to learn these can still accurately identify the dynamics of the system as if under ideal conditions. This makes the models applicable in some situations where other system identification models fail. Furthermore, we propose to use a fourth-order symmetric integration scheme in the loss function and avoid actual integration in the training, and demonstrate on varied examples how this leads to increased performance on noisy data.
