DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
Matej Moravčík, Martin Schmid, Neil Burch, Viliam Lisý, Dustin Morrill, Nolan Bard, Trevor Davis, Kevin Waugh, Michael Johanson, Michael Bowling
TL;DR
DeepStack addresses the challenge of expert-level play in imperfect-information games, specifically heads-up no-limit Texas Hold'em, by integrating recursive CFR-based reasoning with continual re-solving and a learned counterfactual-value function to cap depth. It replaces offline full-game abstraction with online, situation-specific solving, aided by deep neural networks (flop, turn, and auxiliary) that estimate subgame values. The approach yields near-Nash-equilibrium strategies, demonstrated by statistically significant superiority over professional players and strong resistance to exploitation as shown by Local Best Response analyses. This work signals a paradigm shift in handling large, sequential imperfect-information problems by coupling online solving with learned value approximations, enabling practical play and broader applicability.
Abstract
Artificial intelligence has seen several breakthroughs in recent years, with games often serving as milestones. A common feature of these games is that players have perfect information. Poker is the quintessential game of imperfect information, and a longstanding challenge problem in artificial intelligence. We introduce DeepStack, an algorithm for imperfect information settings. It combines recursive reasoning to handle information asymmetry, decomposition to focus computation on the relevant decision, and a form of intuition that is automatically learned from self-play using deep learning. In a study involving 44,000 hands of poker, DeepStack defeated with statistical significance professional poker players in heads-up no-limit Texas hold'em. The approach is theoretically sound and is shown to produce more difficult to exploit strategies than prior approaches.
