Stability-Informed Initialization of Neural Ordinary Differential Equations
Theodor Westny, Arman Mohammadi, Daniel Jung, Erik Frisk
TL;DR
This work analyzes how the stability properties of neural ODE solvers influence training and learned dynamics, linking continuous stability with the solver's stability region for fixed-step Runge-Kutta methods. It introduces a stability-informed initialization that uses linearization, eigenvalue placement, and rejection sampling to place the Jacobian within the solver's stable region, improving training efficiency and predictive accuracy. The authors demonstrate the approach through a teacher–student regression study and diverse experiments in pixel-level classification, latent dynamics, and multivariate time-series forecasting, showing faster convergence and better generalization. The results suggest that aligning model initialization with solver stability constraints can yield robust performance in real-world dynamic learning tasks and motivate broader adoption of stability-aware design in neural ODEs.
Abstract
This paper addresses the training of Neural Ordinary Differential Equations (neural ODEs), and in particular explores the interplay between numerical integration techniques, stability regions, step size, and initialization techniques. It is shown how the choice of integration technique implicitly regularizes the learned model, and how the solver's corresponding stability region affects training and prediction performance. From this analysis, a stability-informed parameter initialization technique is introduced. The effectiveness of the initialization method is displayed across several learning benchmarks and industrial applications.
