No-Regret Strategy Solving in Imperfect-Information Games via Pre-Trained Embedding
Yanchang Fu, Shengda Liu, Pei Xu, Kaiqi Huang
TL;DR
The paper tackles the challenge of solving large-scale imperfect-information extensive-form games under limited resources, where traditional pre-trained clustering-based abstractions lose fine-grained distinctions between information sets.It introduces Embedding CFR, a framework that pre-trains information-set embeddings in a low-dimensional space using an advisor-based extended abstraction and an embedding matrix, enabling regret-based strategy solving in the embedding space with memory-efficient updates.The authors provide an approximate convergence analysis showing regret can decrease within the embedding space when advisors act in isolation, and they design a poker-specific embedding pipeline (HandEbdNet) to generate embedding coordinates on the fly.Empirical results in Numeral211 Hold'em demonstrate that Embedding CFR achieves substantially faster exploitability convergence than state-of-the-art clustering-based abstractions under the same resource constraints, validating the effectiveness of embedding-based information-set abstraction for poker AI.
Abstract
High-quality information set abstraction remains a core challenge in solving large-scale imperfect-information extensive-form games (IIEFGs)--such as no-limit Texas Hold'em--where the finite nature of spatial resources hinders solving strategies for the full game. State-of-the-art AI methods rely on pre-trained discrete clustering for abstraction, yet their hard classification irreversibly discards critical information: specifically, the quantifiable subtle differences between information sets--vital for strategy solving--thus compromising the quality of such solving. Inspired by the word embedding paradigm in natural language processing, this paper proposes the Embedding CFR algorithm, a novel approach for solving strategies in IIEFGs within an embedding space. The algorithm pre-trains and embeds the features of individual information sets into an interconnected low-dimensional continuous space, where the resulting vectors more precisely capture both the distinctions and connections between information sets. Embedding CFR introduces a strategy-solving process driven by regret accumulation and strategy updates in this embedding space, with supporting theoretical analysis verifying its ability to reduce cumulative regret. Experiments on poker show that with the same spatial overhead, Embedding CFR achieves significantly faster exploitability convergence compared to cluster-based abstraction algorithms, confirming its effectiveness. Furthermore, to our knowledge, it is the first algorithm in poker AI that pre-trains information set abstractions via low-dimensional embedding for strategy solving.
