Jacobian Regularization Stabilizes Long-Term Integration of Neural Differential Equations

Maya Janvier; Julien Salomon; Etienne Meunier

Jacobian Regularization Stabilizes Long-Term Integration of Neural Differential Equations

Maya Janvier, Julien Salomon, Etienne Meunier

TL;DR

This paper tackles the stability challenge of long-term integration with Neural Differential Equations by introducing Jacobian-based regularizations that align the learned dynamics with the true system through directional derivatives. It presents two cost-effective losses: an exact directional-derivative loss for known dynamics ($\mathcal{L}_{AD}$) and a finite-difference, unsupervised loss for unknown dynamics ($\mathcal{L}_{FD}$), both leveraging a Hutchinson trace estimator to avoid full Jacobian computations. The approach demonstrates improved long-term stability across two ODE problems (Two-Body and Rigid Body) and one PDE (Kuramoto-Sivashinsky), with distinct strengths: AD excels when Jacobians are tractable, while FD offers robust performance with unknown dynamics and easier tuning. This work enables stable, long-range simulations using neural approximations of dynamical systems without resorting to expensive long training rollouts, thereby broadening applicability to large-scale physical models. $L_F$ and $L_{F_\theta}$ bounds are used to motivate the regularization of $J_F$ toward $J_{F_\theta}$, linking Jacobian accuracy to trajectory stability.$

Abstract

Hybrid models and Neural Differential Equations (NDE) are getting increasingly important for the modeling of physical systems, however they often encounter stability and accuracy issues during long-term integration. Training on unrolled trajectories is known to limit these divergences but quickly becomes too expensive due to the need for computing gradients over an iterative process. In this paper, we demonstrate that regularizing the Jacobian of the NDE model via its directional derivatives during training stabilizes long-term integration in the challenging context of short training rollouts. We design two regularizations, one for the case of known dynamics where we can directly derive the directional derivatives of the dynamic and one for the case of unknown dynamics where they are approximated using finite differences. Both methods, while having a far lower cost compared to long rollouts during training, are successful in improving the stability of long-term simulations for several ordinary and partial differential equations, opening up the door to training NDE methods for long-term integration of large scale systems.

Jacobian Regularization Stabilizes Long-Term Integration of Neural Differential Equations

TL;DR

) and a finite-difference, unsupervised loss for unknown dynamics (

), both leveraging a Hutchinson trace estimator to avoid full Jacobian computations. The approach demonstrates improved long-term stability across two ODE problems (Two-Body and Rigid Body) and one PDE (Kuramoto-Sivashinsky), with distinct strengths: AD excels when Jacobians are tractable, while FD offers robust performance with unknown dynamics and easier tuning. This work enables stable, long-range simulations using neural approximations of dynamical systems without resorting to expensive long training rollouts, thereby broadening applicability to large-scale physical models.

and

bounds are used to motivate the regularization of

toward

, linking Jacobian accuracy to trajectory stability.$

Abstract

Paper Structure (21 sections, 5 theorems, 32 equations, 3 figures, 3 tables)

This paper contains 21 sections, 5 theorems, 32 equations, 3 figures, 3 tables.

Introduction
Challenges of Accurate Long Rollouts for Neural Differential Equations
Learning through solvers: the influence of rollout
Jacobian and long-term stability
Methods
Estimating the norm of Jacobians efficiently
A family of Jacobian derived regularizations
Known dynamics
Unknown dynamics
Results
Two-Body Problem
Rigid Body Problem
Kuramoto-Sivashinsky
Impact on learned $F_\theta$ and $J_{F_\theta}$
Conclusion
...and 6 more sections

Key Result

Proposition 2.2

Trajectory error bound For $t>0$, with $L_{F_\theta}$ the Lipschitz constant of $F_\theta$.

Figures (3)

Figure 1: From the left to the right we have for each row: an example of a trajectory, the evolution of the relative error of the state over time and the distribution final values of $R_e(\theta,T_{\text{test}})$. Top row: Two-Body Problem. Middle row: Rigid Body Problem. $N=5\Delta t$ and $N=10\Delta t$ trajectories are cut at 2000 steps for visualization purposes as they drift further away in time. Bottom row: Kuramoto-Sivashinksy.
Figure 2: Mean MSE on validation set for the optimization of the $\lambda$ parameter, for the Two-Body Problem (left) and the Rigid Body Problem (right). The best outcome is circled.
Figure 3: Mean Relative Error of the conserved properties over time (left) and distribution of final values (right). Top row: Two-Body Problem. Middle row: Rigid Body Problem. Bottom row: Kuramoto-Sivashinsky

Theorems & Definitions (9)

Definition 2.1
Proposition 2.2
proof
Proposition 2.3
proof
Proposition 3.1
Proposition 1.1
proof
Lemma 1.2

Jacobian Regularization Stabilizes Long-Term Integration of Neural Differential Equations

TL;DR

Abstract

Jacobian Regularization Stabilizes Long-Term Integration of Neural Differential Equations

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (9)