Neural McKean-Vlasov Processes: Distributional Dependence in Diffusion Processes
Haoming Yang, Ali Hasan, Yuting Ng, Vahid Tarokh
TL;DR
This work extends diffusion-based modeling by integrating distributional dependence through MV-SDEs and introduces three neural mean-field architectures (EM, IM, ML) to parameterize drift with respect to the state law $p_t$. It develops MLE-based, Brownian-bridge, and PDE-consistent estimation schemes, and reveals implicit regularization and attention-like connections arising from distributional terms. Empirical results on synthetic MV-SDEs, real-world time-series data, and generative tasks demonstrate that explicit distributional dependence can improve modeling of interacting systems and enable richer probability flows, without sacrificing performance on standard Itô-SDE tasks. The findings suggest MV-SDEs offer a flexible framework for modeling temporal data with interactions, with practical impact on time-series forecasting and density-based generative modeling.
Abstract
McKean-Vlasov stochastic differential equations (MV-SDEs) provide a mathematical description of the behavior of an infinite number of interacting particles by imposing a dependence on the particle density. As such, we study the influence of explicitly including distributional information in the parameterization of the SDE. We propose a series of semi-parametric methods for representing MV-SDEs, and corresponding estimators for inferring parameters from data based on the properties of the MV-SDE. We analyze the characteristics of the different architectures and estimators, and consider their applicability in relevant machine learning problems. We empirically compare the performance of the different architectures and estimators on real and synthetic datasets for time series and probabilistic modeling. The results suggest that explicitly including distributional dependence in the parameterization of the SDE is effective in modeling temporal data with interaction under an exchangeability assumption while maintaining strong performance for standard Itô-SDEs due to the richer class of probability flows associated with MV-SDEs.
