Deep Neural Networks Inspired by Differential Equations
Yongshuai Liu, Lianfang Wang, Kuilin Qin, Qinghua Zhang, Faqiang Wang, Li Cui, Jun Liu, Yuping Duan, Tieyong Zeng
TL;DR
This survey surveys how deep learning can be viewed through the lens of differential equations, unifying deterministic (ODE/PDE) and stochastic (SDE) dynamics to guide architecture and training. It categorizes approaches into DDE-driven architectures (e.g., ResNets, LM-ResNets, RevNets, PDE-CNNs) and deterministic dynamics (Neural ODEs, flow-based models), as well as stochastic methods (stochastic regularization and Neural SDEs) that inform robust learning and generative modeling. The paper provides a structured synthesis of theoretical foundations, representative models, and empirical evidence (e.g., CIFAR-10/100 performance) and discusses future directions toward higher-order, delayed, and fractional DEs, non-Gaussian noise, and domain-specific constraints. By linking continuous-time dynamics, regularization, and diffusion-based generative processes, it underscores the potential for more interpretable, stable, and data-efficient deep learning systems with rigorous mathematical underpinnings.
Abstract
Deep learning has become a pivotal technology in fields such as computer vision, scientific computing, and dynamical systems, significantly advancing these disciplines. However, neural Networks persistently face challenges related to theoretical understanding, interpretability, and generalization. To address these issues, researchers are increasingly adopting a differential equations perspective to propose a unified theoretical framework and systematic design methodologies for neural networks. In this paper, we provide an extensive review of deep neural network architectures and dynamic modeling methods inspired by differential equations. We specifically examine deep neural network models and deterministic dynamical network constructs based on ordinary differential equations (ODEs), as well as regularization techniques and stochastic dynamical network models informed by stochastic differential equations (SDEs). We present numerical comparisons of these models to illustrate their characteristics and performance. Finally, we explore promising research directions in integrating differential equations with deep learning to offer new insights for developing intelligent computational methods that boast enhanced interpretability and generalization capabilities.
