Semi-Implicit Neural Ordinary Differential Equations
Hong Zhang, Ying Liu, Romit Maulik
TL;DR
The paper addresses the instability and inefficiency of training stiff neural ODEs by introducing SINODE, a semi-implicit framework that partitions the dynamics into a nonlinear part $\mathcal{G}(u)$ and a linear part $\mathcal{H}(u)=\mathcal{J}u$. It leverages implicit-explicit (IMEX) Runge–Kutta integration for forward passes and a discrete, reverse-accurate adjoint for backward passes, enabling stable large time steps and efficient linear solves with LU reuse or matrix-free methods. Empirically, SINODE demonstrates superior stability and speed on graph diffusion tasks (GRAND) and nonlinear/time-series PDE dynamics (Kuramoto–Sivashinsky, Burgers), achieving fewer right-hand side evaluations and faster training than explicit or fully implicit baselines. The approach broadens the applicability of neural ODEs to stiff problems in graph learning and scientific ML while preserving memory efficiency through checkpointing and Jacobian-vector products.Overall, SINODE provides a practical and scalable route to training stiff neural ODEs with strong stability properties and compatibility with existing high-performance linear solvers.
Abstract
Classical neural ODEs trained with explicit methods are intrinsically limited by stability, crippling their efficiency and robustness for stiff learning problems that are common in graph learning and scientific machine learning. We present a semi-implicit neural ODE approach that exploits the partitionable structure of the underlying dynamics. Our technique leads to an implicit neural network with significant computational advantages over existing approaches because of enhanced stability and efficient linear solves during time integration. We show that our approach outperforms existing approaches on a variety of applications including graph classification and learning complex dynamical systems. We also demonstrate that our approach can train challenging neural ODEs where both explicit methods and fully implicit methods are intractable.
