A Survey on Game Theory Optimal Poker
Prathamesh Sonawane, Arav Chheda
TL;DR
The paper surveys game-theoretic approaches to poker in imperfect-information settings, contrasting Game Theory Optimal (GTO) play with exploitative strategies and emphasizing practical abstractions. It reviews abstraction methods (notably bucketing and LP-based solutions) and betting models (discretized and LokiLoki) that make large games tractable, alongside result-oriented strategies such as CFR+. The discussion extends to multi-player settings (e.g., Pluribus) where purely symbolic GTO solutions are computationally intractable, highlighting ML-based exploitative approaches. The authors conclude with limitations and future directions, recommending more automated parameter tuning and hybrid ML-symbolic methods to scale to real-world, multi-agent poker with broader opponent modeling.
Abstract
Poker is in the family of imperfect information games unlike other games such as chess, connect four, etc which are perfect information game instead. While many perfect information games have been solved, no non-trivial imperfect information game has been solved to date. This makes poker a great test bed for Artificial Intelligence research. In this paper we firstly compare Game theory optimal poker to Exploitative poker. Secondly, we discuss the intricacies of abstraction techniques, betting models, and specific strategies employed by successful poker bots like Tartanian[1] and Pluribus[6]. Thirdly, we also explore 2-player vs multi-player games and the limitations that come when playing with more players. Finally, this paper discusses the role of machine learning and theoretical approaches in developing winning strategies and suggests future directions for this rapidly evolving field.
