Stable Port-Hamiltonian Neural Networks
Fabian J. Roth, Dominik K. Klein, Maximilian Kannapinn, Jan Peters, Oliver Weeger
TL;DR
This work addresses the fragility of purely data-driven dynamic models by embedding physical priors into learning via stable port-Hamiltonian neural networks (sPHNN). By enforcing a convex, positive-definite Hamiltonian and energy-dissipative structure, the approach guarantees global Lyapunov stability (global asymptotic stability when dissipation is present) without relying on projection-based constraints. Across spinning rigid body dynamics, cascaded tanks, thermal food-processing surrogates, and additive-manufacturing simulations, sPHNNs achieve superior stability, accuracy, and generalization from sparse data, with a learnable-equilibrium variant (-LM) capable of inferring the equilibrium location when unknown. The results demonstrate that energy-based priors enable safe, data-efficient surrogate modeling for multi-physics systems and provide interpretable components by separating conservative, dissipative, and input-driven dynamics.
Abstract
In recent years, nonlinear dynamic system identification using artificial neural networks has garnered attention due to its broad potential applications across science and engineering. However, purely data-driven approaches often struggle with extrapolation and may yield physically implausible forecasts. Furthermore, the learned dynamics can exhibit instabilities, making it difficult to apply such models safely and robustly. This article introduces stable port-Hamiltonian neural networks, a machine learning architecture that incorporates physical biases of energy conservation and dissipation while ensuring global Lyapunov stability of the learned dynamics. Through illustrative and real-world examples, we demonstrate that these strong inductive biases facilitate robust learning of stable dynamics from sparse data, while avoiding instability and surpassing purely data-driven approaches in accuracy and physically meaningful generalization. Furthermore, the model's applicability and potential for data-driven surrogate modeling are showcased on multi-physics simulation data.
