Open-Ended Wargames with Large Language Models
Daniel P. Hogan, Andrea Brennen
TL;DR
The paper tackles automating open-ended qualitative wargames, which have been difficult to automate with prior quantitative-focused work. It introduces Snow Globe, an open-source Python-based multi-agent framework where a control moderator, AI/human players, and teams interact through a history object and LLM-powered prompts to produce coherent narratives and adjudications. The authors detail the architecture, a zero-shot, locally runnable LLM pipeline, and an asynchronous human-in-the-loop design, plus two demonstrations: an AI incident-response tabletop exercise and a geopolitical crisis simulation with persona-based dynamics. The work argues that unconstrained, text-based moves become tractable with LLMs, enabling rapid, diverse scenario exploration and human-AI collaboration across domains.
Abstract
Wargames are a powerful tool for understanding and rehearsing real-world decision making. Automated play of wargames using artificial intelligence (AI) enables possibilities beyond those of human-conducted games, such as playing the game many times over to see a range of possible outcomes. There are two categories of wargames: quantitative games, with discrete types of moves, and qualitative games, which revolve around open-ended responses. Historically, automation efforts have focused on quantitative games, but large language models (LLMs) make it possible to automate qualitative wargames. We introduce "Snow Globe," an LLM-powered multi-agent system for playing qualitative wargames. With Snow Globe, every stage of a text-based qualitative wargame from scenario preparation to post-game analysis can be optionally carried out by AI, humans, or a combination thereof. We describe its software architecture conceptually and release an open-source implementation alongside this publication. As case studies, we simulate a tabletop exercise about an AI incident response and a political wargame about a geopolitical crisis. We discuss potential applications of the approach and how it fits into the broader wargaming ecosystem.
