Neural DAEs: Constrained neural networks

Tue Boesen; Eldad Haber; Uri Michael Ascher

Neural DAEs: Constrained neural networks

Tue Boesen, Eldad Haber, Uri Michael Ascher

TL;DR

This work addresses how to preserve algebraic constraint information in neural network models of dynamical systems by embedding constraint data into architectures rather than relying on data alone. It introduces four approaches—auxiliary regularization, end constraints, smooth constraints, and stabilization by penalty—to enforce constraints expressed as $c(K z(tau)) = 0$, demonstrating their effectiveness on multi-body pendulums, water MD simulations, and divergence-free vector-field denoising. Across these domains, projection-based methods, especially when applied smoothly through the network, yield the strongest improvements in constraint satisfaction and often improve predictive accuracy, highlighting the practical impact of constraint-aware learning in physics-informed contexts. The results suggest a robust, architecture-agnostic strategy for integrating hard physical constraints into neural models, with tradeoffs in computational cost and implementation complexity and clear directions for future refinement, such as advanced solvers, analytic backpropagation for projections, and higher-order constraint information.

Abstract

This article investigates the effect of explicitly adding auxiliary algebraic trajectory information to neural networks for dynamical systems. We draw inspiration from the field of differential-algebraic equations and differential equations on manifolds and implement related methods in residual neural networks, despite some fundamental scenario differences. Constraint or auxiliary information effects are incorporated through stabilization as well as projection methods, and we show when to use which method based on experiments involving simulations of multi-body pendulums and molecular dynamics scenarios. Several of our methods are easy to implement in existing code and have limited impact on training performance while giving significant boosts in terms of inference.

Neural DAEs: Constrained neural networks

TL;DR

, demonstrating their effectiveness on multi-body pendulums, water MD simulations, and divergence-free vector-field denoising. Across these domains, projection-based methods, especially when applied smoothly through the network, yield the strongest improvements in constraint satisfaction and often improve predictive accuracy, highlighting the practical impact of constraint-aware learning in physics-informed contexts. The results suggest a robust, architecture-agnostic strategy for integrating hard physical constraints into neural models, with tradeoffs in computational cost and implementation complexity and clear directions for future refinement, such as advanced solvers, analytic backpropagation for projections, and higher-order constraint information.

Abstract

Paper Structure (23 sections, 26 equations, 22 figures, 4 tables)

This paper contains 23 sections, 26 equations, 22 figures, 4 tables.

Introduction
Residual Networks and Constraints
Adding auxiliary constraint information
Auxiliary regularization
Projecting the final state onto the constraint manifold
Projecting the state onto the constraint manifold throughout the network
Stabilization by penalty
Model Problems
The multi-body pendulum
Water molecules simulation
Vector field denoising
Experiments
Multi-body pendulum
Molecular dynamics simulations - water molecules
Vector field denoising
...and 8 more sections

Figures (22)

Figure 1: A multi-body pendulum system with four pendulums.
Figure 1: Snapshots of a five-body pendulum. These should be viewed as snapshots of a single pendulum animation. Each snapshot is 100 steps after the previous one (0.1). The arrows indicate the velocity of each individual pendulum.
Figure 1: Comparison of neural networks using penalty stabilization with different strengths $\gamma$. All neural networks are trained on 100 samples with $k=100$ on the multi-body pendulum system described in Section \ref{['sec:experiment_pendulum']}. Each run has been repeated three times, the solid/dashed lines are the average based on validation/training data. Note that the blue line is mostly hidden beneath the orange line
Figure 1: Comparison of neural networks using auxiliary regularization with different strengths $\eta$. All neural networks are trained on 100 samples with $k=100$ on the five-body pendulum system described in Section \ref{['sec:experiment_pendulum']}. Each run has been repeated three times, the solid/dashed lines are the average based on validation/training data. Note that the blue line is mostly hidden beneath the orange line.
Figure 1: Comparison of the different ways of adding constraints to neural network trained on 100 multi-body pendulum samples with $k=100$. Left: Mean absolute error. Right: Maximum constraint violation. Each run has been repeated three times, the solid/dashed lines are the average based on validation/training data.
...and 17 more figures

Neural DAEs: Constrained neural networks

TL;DR

Abstract

Neural DAEs: Constrained neural networks

Authors

TL;DR

Abstract

Table of Contents

Figures (22)