Modeling Other Players with Bayesian Beliefs for Games with Incomplete Information
Zuyuan Zhang, Mahdi Imani, Tian Lan
TL;DR
The paper tackles decision-making in Bayesian games with incomplete information by marrying Bayesian belief updates with counterfactual regret minimization. It introduces Bayesian-CFR, which uses a kernel-density posterior to estimate players' type distributions and defines Bayesian regret to drive equilibrium computation, with a theoretical regret bound that includes a time-varying term $\Delta_{\Theta}^T$. Extensions to Bayesian CFR+ and Deep Bayesian CFR demonstrate scalable performance, aided by type-aware neural architectures and accumulated Bayesian regret. Empirical evaluation in Texas Hold'em shows substantial exploitability reductions versus traditional CFR baselines, validating both the methodology and its practical impact for reasoning about other agents' hidden types. Overall, the work provides a rigorous, scalable framework for solving Bayesian Nash Equilibria in extensive-form games under partial information.
Abstract
Bayesian games model interactive decision-making where players have incomplete information -- e.g., regarding payoffs and private data on players' strategies and preferences -- and must actively reason and update their belief models (with regard to such information) using observation and interaction history. Existing work on counterfactual regret minimization have shown great success for games with complete or imperfect information, but not for Bayesian games. To this end, we introduced a new CFR algorithm: Bayesian-CFR and analyze its regret bound with respect to Bayesian Nash Equilibria in Bayesian games. First, we present a method for updating the posterior distribution of beliefs about the game and other players' types. The method uses a kernel-density estimate and is shown to converge to the true distribution. Second, we define Bayesian regret and present a Bayesian-CFR minimization algorithm for computing the Bayesian Nash equilibrium. Finally, we extend this new approach to other existing algorithms, such as Bayesian-CFR+ and Deep Bayesian CFR. Experimental results show that our proposed solutions significantly outperform existing methods in classical Texas Hold'em games.
