Table of Contents
Fetching ...

Optimally Installing Strict Equilibria

Jeremy McMahan, Young Wu, Yudong Chen, Xiaojin Zhu, Qiaomin Xie

TL;DR

This work develops a reward-design framework to install a target behavior as a strict equilibrium across DSE, NE, CE, and CCE, including their Markov-perfect variants, in known-transition Markov games. It provides exact feasibility characterizations for strictly installable policies in normal-form games and extends these to stage-wise and Markov-game settings, yielding polynomial-time algorithms to verify installability and construct feasible rewards. The authors introduce LP-based methods to solve the optimal reward-design problem under a reward bound and discuss robustness under bounded rationality via epsilon-strict equilibria, including detailed deviation-class analyses. Key contributions include complete CE/CCE installability characterizations, epsilon-strictness bounds, and scalable algorithms for both normal-form and Markov-game settings, enabling practical deployment of strong, predictable equilibria in sequential multi-agent environments. The results have practical impact for safe traffic control, resource allocation, and other domains where enforcing a desired compliant behavior is critical while keeping changes within realistic constraints.

Abstract

In this work, we develop a reward design framework for installing a desired behavior as a strict equilibrium across standard solution concepts: dominant strategy equilibrium, Nash equilibrium, correlated equilibrium, and coarse correlated equilibrium. We also extend our framework to capture the Markov-perfect equivalents of each solution concept. Central to our framework is a comprehensive mathematical characterization of strictly installable, based on the desired solution concept and the behavior's structure. These characterizations lead to efficient iterative algorithms, which we generalize to handle optimization objectives through linear programming. Finally, we explore how our results generalize to bounded rational agents.

Optimally Installing Strict Equilibria

TL;DR

This work develops a reward-design framework to install a target behavior as a strict equilibrium across DSE, NE, CE, and CCE, including their Markov-perfect variants, in known-transition Markov games. It provides exact feasibility characterizations for strictly installable policies in normal-form games and extends these to stage-wise and Markov-game settings, yielding polynomial-time algorithms to verify installability and construct feasible rewards. The authors introduce LP-based methods to solve the optimal reward-design problem under a reward bound and discuss robustness under bounded rationality via epsilon-strict equilibria, including detailed deviation-class analyses. Key contributions include complete CE/CCE installability characterizations, epsilon-strictness bounds, and scalable algorithms for both normal-form and Markov-game settings, enabling practical deployment of strong, predictable equilibria in sequential multi-agent environments. The results have practical impact for safe traffic control, resource allocation, and other domains where enforcing a desired compliant behavior is critical while keeping changes within realistic constraints.

Abstract

In this work, we develop a reward design framework for installing a desired behavior as a strict equilibrium across standard solution concepts: dominant strategy equilibrium, Nash equilibrium, correlated equilibrium, and coarse correlated equilibrium. We also extend our framework to capture the Markov-perfect equivalents of each solution concept. Central to our framework is a comprehensive mathematical characterization of strictly installable, based on the desired solution concept and the behavior's structure. These characterizations lead to efficient iterative algorithms, which we generalize to handle optimization objectives through linear programming. Finally, we explore how our results generalize to bounded rational agents.

Paper Structure

This paper contains 27 sections, 14 theorems, 21 equations, 2 algorithms.

Key Result

Proposition 1

Given any mixed product strategy $\sigma \in \Delta(\mathcal{A})$, there exists a utility function $u$ for which $\sigma$ is an sNE for $(\mathcal{A}, u)$ if and only if for all players $i \in [n]$ we have that $p_{ij} = 1$ for some $j \in \mathcal{A}_i$.

Theorems & Definitions (26)

  • Definition 1: Strict Equilibria
  • Remark 1: Strictness Trade-off
  • Definition 2: Strict Installability
  • Definition 3: Optimal Reward Design
  • Example 1: Costs
  • Definition 4: Conditionals
  • Proposition 1: sNE installability
  • Theorem 1: sCE installability
  • proof
  • Corollary 1
  • ...and 16 more