On the Outcome Equivalence of Extensive-Form and Behavioral Correlated Equilibria
Brian Hu Zhang, Tuomas Sandholm
TL;DR
This paper proves that extensive-form correlated equilibria (EFCE) and behavioral correlated equilibria (BCE) are outcome-equivalent: any $\\varepsilon$-EFCE can be transformed into an $\\varepsilon$-BCE that induces the same distribution over terminal nodes. It provides a constructive, polynomial-time method to perform this transformation, enabling the first polynomial-time algorithm for computing a BCE in extensive-form games. The authors introduce counterfactual-regret-based BCE deviations, define a transformation via counterfactual best-response strategies $x_i^{Ia}$, and show the resulting profile is a BCE with the same outcome distribution as the original EFCE. They also discuss implications for optimizing equilibria, learning dynamics, and the limits of stronger notions of outcome equivalence, outlining future directions in polynomial-time computation and dynamics for BCE.
Abstract
We investigate two notions of correlated equilibrium for extensive-form games: extensive-form correlated equilibrium (EFCE) and behavioral correlated equilibrium (BCE). We show that the two are outcome-equivalent, in the sense that every outcome distribution achievable under one notion is achievable under the other. Our result implies, to our knowledge, the first polynomial-time algorithm for computing a BCE.
