Neural SDEs as a Unified Approach to Continuous-Domain Sequence Modeling
Macheng Shen, Chen Cheng
TL;DR
This work presents Neural SDEs as a unified framework for continuous-time sequence modeling, treating sequences as trajectories of an underlying stochastic dynamical system and learning both drift and diffusion via maximum likelihood. By employing a time-invariant, diagonally diffusive SDE and a simulation-free training objective, the approach directly models state-to-state transitions without unrolling from a noise prior. Empirical results demonstrate accurate multi-modal trajectory generation, robustness to sharp dynamics, and efficient inference with few steps in tasks ranging from imitation learning to video prediction, with added capability for temporal interpolation. The method offers a principled bridge between continuous-time dynamics and data-driven sequence modeling, enabling scalable, interpretable modeling of complex temporal processes in embodied and generative AI settings.
Abstract
Inspired by the ubiquitous use of differential equations to model continuous dynamics across diverse scientific and engineering domains, we propose a novel and intuitive approach to continuous sequence modeling. Our method interprets time-series data as \textit{discrete samples from an underlying continuous dynamical system}, and models its time evolution using Neural Stochastic Differential Equation (Neural SDE), where both the flow (drift) and diffusion terms are parameterized by neural networks. We derive a principled maximum likelihood objective and a \textit{simulation-free} scheme for efficient training of our Neural SDE model. We demonstrate the versatility of our approach through experiments on sequence modeling tasks across both embodied and generative AI. Notably, to the best of our knowledge, this is the first work to show that SDE-based continuous-time modeling also excels in such complex scenarios, and we hope that our work opens up new avenues for research of SDE models in high-dimensional and temporally intricate domains.
