Recent Developments in Machine Learning Methods for Stochastic Control and Games
Ruimeng Hu, Mathieu Laurière
TL;DR
This survey surveys neural network–based methods for solving stochastic control and differential games, emphasizing high-dimensional and complex dynamics including delays and common noise. It connects deep BSDEs, PDE-based deep learning, and dynamic-programming–inspired direct parameterization, highlighting model-based and model-free approaches as well as mean-field formulations. Key contributions include (i) systematic distillation of Deep BSDE, DBDP, and Deep Galerkin methodologies, (ii) extended treatments for delay, mean-field control, N-player games, and mean-field games with common noise, and (iii) demonstrations in price-impact, epidemic control, and systemic-risk contexts. The practical impact lies in providing scalable, architecture-informed frameworks to approximate Nash equilibria and optimal controls in high-dimensional stochastic systems, with emphasis on learning-driven, data-efficient approaches and master-equation perspectives.
Abstract
Stochastic optimal control and games have a wide range of applications, from finance and economics to social sciences, robotics, and energy management. Many real-world applications involve complex models that have driven the development of sophisticated numerical methods. Recently, computational methods based on machine learning have been developed for solving stochastic control problems and games. In this review, we focus on deep learning methods that have unlocked the possibility of solving such problems, even in high dimensions or when the structure is very complex, beyond what traditional numerical methods can achieve. We consider mostly the continuous time and continuous space setting. Many of the new approaches build on recent neural-network-based methods for solving high-dimensional partial differential equations or backward stochastic differential equations, or on model-free reinforcement learning for Markov decision processes that have led to breakthrough results. This paper provides an introduction to these methods and summarizes the state-of-the-art works at the crossroad of machine learning and stochastic control and games.
