A Policy Iteration Method for Inverse Mean Field Games
Kui Ren, Nathan Soedjak, Shanyin Tong
TL;DR
The paper tackles reconstructing the obstacle function $b(x)$ in inverse mean-field games from partial observations of the value function. It introduces a policy-iteration scheme that decouples the nonlinear forward-backward MFG system into linear FP solves and linear inverse problems, updating the policy via $q^{(k+1)}=H_p( abla u^{(k)})$. The authors prove uniform convergence and a linear rate of convergence for small time horizons under quadratic Hamiltonians, and demonstrate through 1D and 2D numerics that the method is more efficient and accurate than direct PDE-constrained least-squares, including robustness to noise. The work offers a scalable, theoretically grounded approach for calibrating MFG models using observed value data with practical implications for economics, crowd dynamics, and related fields.
Abstract
We propose a policy iteration method to solve an inverse problem for a mean-field game (MFG) model, specifically to reconstruct the obstacle function in the game from the partial observation data of value functions, which represent the optimal costs for agents. The proposed approach decouples this complex inverse problem, which is an optimization problem constrained by a coupled nonlinear forward and backward PDE system in the MFG, into several iterations of solving linear PDEs and linear inverse problems. This method can also be viewed as a fixed-point iteration that simultaneously solves the MFG system and inversion. We prove its linear rate of convergence. In addition, numerical examples in 1D and 2D, along with performance comparisons to a direct least-squares method, demonstrate the superior efficiency and accuracy of the proposed method for solving inverse MFGs.
