Quantifying Feature Importance of Games and Strategies via Shapley Values
Satoru Fujii
TL;DR
The paper addresses the explainability gap in game AI by introducing two Shapley-based metrics: Shapley Game Feature Importance (SGFI) for global game-level interpretation and Shapley Strategy Feature Importance (SSFI) for local strategy explanations. SGFI quantifies how obscuring subsets of game features affects the target player's expected return, while SSFI decomposes a given AI's strategy into per-feature contributions via permutation-based attribution. The authors ground the methods in extensive-form game theory, exploitability, and abstraction, and validate them empirically on the Goofspiel benchmark, showing that features like Center and Deck are typically more influential and that SSFI decompositions align with intuitive strategic reasoning. Overall, these explainability tools aim to enhance human understanding and collaboration with AI in complex strategic settings, while providing principled, algorithmic insights into feature importance.
Abstract
Recent advances in game informatics have enabled us to find strong strategies across a diverse range of games. However, these strategies are usually difficult for humans to interpret. On the other hand, research in Explainable Artificial Intelligence (XAI) has seen a notable surge in scholarly activity. Interpreting strong or near-optimal strategies or the game itself can provide valuable insights. In this paper, we propose two methods to quantify the feature importance using Shapley values: one for the game itself and another for individual AIs. We empirically show that our proposed methods yield intuitive explanations that resonate with and augment human understanding.
