Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value
Young Wu, Jeremy McMahan, Yiding Chen, Yudong Chen, Xiaojin Zhu, Qiaomin Xie
TL;DR
The paper introduces a formal framework for minimally modifying a two-player zero-sum Markov game to install a target MPE with a specified value range, while minimizing a cost of modification. It provides necessary and sufficient conditions (SIISOW and INV) for the target NE to be unique and reformulates the problem into a convex linear program augmented with spectral constraints, then remedies nonconvexity via a Relax and Perturb (RAP) approach that adds a small eRPS-based perturbation to guarantee invertibility with probability one. The authors extend the method from normal-form games to Markov games, using stage-game Q-functions and Bellman consistency, and propose RAP-MG with analogous feasibility and asymptotic optimality guarantees. Through toy experiments and scale benchmarks, they demonstrate the algorithm’s ability to install mixed-strategy equilibria, control game value, and scale to large action spaces and horizons, with public code available for replication. The work advances understanding of strategic game modification, offering practical tools for both benign design and defensive modeling in multi-agent settings, with several directions for future extension to more general settings and constraints.
Abstract
We study the game modification problem, where a benevolent game designer or a malevolent adversary modifies the reward function of a zero-sum Markov game so that a target deterministic or stochastic policy profile becomes the unique Markov perfect Nash equilibrium and has a value within a target range, in a way that minimizes the modification cost. We characterize the set of policy profiles that can be installed as the unique equilibrium of a game and establish sufficient and necessary conditions for successful installation. We propose an efficient algorithm that solves a convex optimization problem with linear constraints and then performs random perturbation to obtain a modification plan with a near-optimal cost. The code for our algorithm is available at https://github.com/YoungWu559/game-modification .
