DeepPAAC: A New Deep Galerkin Method for Principal-Agent Problems
Michael Ludkovski, Changgen Xie, Zimu Zhu
TL;DR
This work tackles numerical resolution of continuous-time Principal-Agent problems by introducing DeepPAAC, a deep learning variant of the Deep Galerkin Method organized as an Actor-Critic algorithm to handle implicit Hamiltonians. The method uses separate neural networks for the value function and the optimal feedback control, minimizing PDE residuals and embedding a terminal constraint through a correction term. The authors validate DeepPAAC across five multidimensional case studies, including constrained contracts and explicit-solution benchmarks, demonstrating stability, accuracy, and faster convergence relative to prior DGM-based approaches. This approach broadens the computational toolkit for complex PA models and highlights the potential of neural PDE solvers for high-dimensional stochastic control problems.
Abstract
We consider numerical resolution of principal-agent (PA) problems in continuous time. We formulate a generic PA model with continuous and lump payments and a multi-dimensional strategy of the agent. To tackle the resulting Hamilton-Jacobi-Bellman equation with an implicit Hamiltonian we develop a novel deep learning method: the Deep Principal-Agent Actor Critic (DeepPAAC) Actor-Critic algorithm. DeepPAAC is able to handle multi-dimensional states and controls, as well as constraints. We investigate the role of the neural network architecture, training designs, loss functions, etc. on the convergence of the solver, presenting five different case studies.
