Multi-Agent Relative Investment Games in a Jump Diffusion Market with Deep Reinforcement Learning Algorithm
Liwei Lu, Ruimeng Hu, Xu Yang, Yi Zhu
TL;DR
The authors address multi-agent investment decisions in jump-diffusion markets by deriving semi-explicit constant Nash equilibria under CARA and CRRA utilities and by developing a deep reinforcement learning framework to solve stochastic control problems with jumps, extended to differential games via fictitious play. The RL approach uses an actor-critic architecture with Itô-Lévy dynamics, stabilized rewards, and parallelized fictitious-play iterations to learn both value and policy functions, handling control in both diffusion and jump terms. Theoretical results establish existence and uniqueness of constant Nash equilibria for the exponential, power, and logarithmic cases, while numerical experiments (Merton with jumps, high-dimensional LQR, and multi-agent portfolio games) demonstrate accurate convergence to equilibria, scalability to higher dimensions, and substantial speedups from parallel computation. These contributions offer a model-free, data-friendly toolkit for solving complex stochastic games in markets with jumps, with potential extensions to data-driven calibration and convergence analysis.
Abstract
This paper focuses on multi-agent stochastic differential games for jump-diffusion systems. On one hand, we study the multi-agent game for optimal investment in a jump-diffusion market. We derive constant Nash equilibria and provide sufficient conditions for their existence and uniqueness for exponential, power, and logarithmic utilities, respectively. On the other hand, we introduce a computational framework based on the actor-critic method in deep reinforcement learning to solve the stochastic control problem with jumps. We extend this algorithm to address the multi-agent game with jumps and utilize parallel computing to enhance computational efficiency. We present numerical examples of the Merton problem with jumps, linear quadratic regulators, and the optimal investment game under various settings to demonstrate the accuracy, efficiency, and robustness of the proposed method. In particular, neural network solutions numerically converge to the derived constant Nash equilibrium for the multi-agent game.
