In-Context Exploiter for Extensive-Form Games

Shuxin Li; Chang Yang; Youzhi Zhang; Pengdeng Li; Xinrun Wang; Xiao Huang; Hau Chan; Bo An

In-Context Exploiter for Extensive-Form Games

Shuxin Li, Chang Yang, Youzhi Zhang, Pengdeng Li, Xinrun Wang, Xiao Huang, Hau Chan, Bo An

TL;DR

This work introduces a novel method, In-Context Exploiter (ICE), to train a single model that can act as any player in the game and adaptively exploit opponents entirely by in-context learning.

Abstract

Nash equilibrium (NE) is a widely adopted solution concept in game theory due to its stability property. However, we observe that the NE strategy might not always yield the best results, especially against opponents who do not adhere to NE strategies. Based on this observation, we pose a new game-solving question: Can we learn a model that can exploit any, even NE, opponent to maximize their own utility? In this work, we make the first attempt to investigate this problem through in-context learning. Specifically, we introduce a novel method, In-Context Exploiter (ICE), to train a single model that can act as any player in the game and adaptively exploit opponents entirely by in-context learning. Our ICE algorithm involves generating diverse opponent strategies, collecting interactive history training data by a reinforcement learning algorithm, and training a transformer-based agent within a well-designed curriculum learning framework. Finally, comprehensive experimental results validate the effectiveness of our ICE algorithm, showcasing its in-context learning ability to exploit any unknown opponent, thereby positively answering our initial game-solving question.

In-Context Exploiter for Extensive-Form Games

TL;DR

This work introduces a novel method, In-Context Exploiter (ICE), to train a single model that can act as any player in the game and adaptively exploit opponents entirely by in-context learning.

Abstract

Paper Structure (23 sections, 1 equation, 11 figures, 2 tables, 2 algorithms)

This paper contains 23 sections, 1 equation, 11 figures, 2 tables, 2 algorithms.

Introduction
Background and Related Work
Problem Statement
In-Context Exploiter
Opponent Generation
Interactive History Collection
Curriculum Learning
Curriculum Generation
Transformer Training
Step-by-step Training
Empirical Evaluation
Experimental Setting
Experimental Results
Conclusion
Discussion
...and 8 more sections

Figures (11)

Figure 1: Comparison between equilibrium finding and our method
Figure 2: Overview of ICE
Figure 3: In-distribution results when acting as any player
Figure 4: Results against various opponents (left-Kuhn, middle-Leduc, right-Goofspiel)
Figure 5: Results against NE opponent
...and 6 more figures

In-Context Exploiter for Extensive-Form Games

TL;DR

Abstract

In-Context Exploiter for Extensive-Form Games

Authors

TL;DR

Abstract

Table of Contents

Figures (11)