Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf

Xuanfa Jin; Ziyan Wang; Yali Du; Meng Fang; Haifeng Zhang; Jun Wang

Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf

Xuanfa Jin, Ziyan Wang, Yali Du, Meng Fang, Haifeng Zhang, Jun Wang

TL;DR

This work addresses the challenge of strategic discussion in multi-agent, uncertain games by formulating ONUW as a Multi-Phase Extensive-Form Bayesian Game and proving PBEs with and without Day-phase discussion. It then introduces an RL-instructed LLM-based agent framework that learns a discrete set of discussion tactics via offline RL (Conservative Q-Learning) to influence beliefs and public speech, aiming to better approximate PBEs in ONUW. Empirical results in three- and five-player ONUW show that the learned discussion policy improves alignment with equilibria and enhances agent performance across GPT-4 and Gemini backends, with RL-trained policies outperforming direct LLM prompting. The findings highlight the importance of controllable discussion strategies in complex communication games and offer a scalable pathway for robust, belief-grounded LLM agents in uncertain, strategic environments.

Abstract

Communication is a fundamental aspect of human society, facilitating the exchange of information and beliefs among people. Despite the advancements in large language models (LLMs), recent agents built with these often neglect the control over discussion tactics, which are essential in communication scenarios and games. As a variant of the famous communication game Werewolf, One Night Ultimate Werewolf (ONUW) requires players to develop strategic discussion policies due to the potential role changes that increase the uncertainty and complexity of the game. In this work, we first present the existence of the Perfect Bayesian Equilibria (PBEs) in two scenarios of the ONUW game: one with discussion and one without. The results showcase that the discussion greatly changes players' utilities by affecting their beliefs, emphasizing the significance of discussion tactics. Based on the insights obtained from the analyses, we propose an RL-instructed language agent framework, where a discussion policy trained by reinforcement learning (RL) is employed to determine appropriate discussion tactics to adopt. Our experimental results on several ONUW game settings demonstrate the effectiveness and generalizability of our proposed framework. The project page of our paper: $\href{https://one-night-ultimate-werewolf.github.io}{one-night-ultimate-werewolf.github.io}$.

Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf

TL;DR

Abstract

Paper Structure (39 sections, 4 theorems, 28 equations, 7 figures, 5 tables)

This paper contains 39 sections, 4 theorems, 28 equations, 7 figures, 5 tables.

Introduction
Related Work
One Night Ultimate Werewolf Benchmark
Problem Formulation
Analyses on Three-Player ONUW
Notation
Game without Discussion
Game with Discussion
Learning to Discuss Strategically
Learning Discussion Policy by RL
RL-instructed LLM-based Agent Framework
Experiments
Setup
Experiment on Three-Player ONUW
Effectiveness of the Discussion Policy
...and 24 more sections

Key Result

Theorem 4.1

For the ONUW game with two Werewolves and one Robber, in the case where discussion is not allowed, there exist PBEs $(\bm{\pi}^*, \bm{b}^*)$: the Robber switches with any Werewolves with a probability of $1/2$ and votes for the player it switches with; the two Werewolves directly vote for each other

Figures (7)

Figure 1: The game process of the ONUW game. Initially, roles are randomly dealt to players. Then three phases: Night (abilities performed in order), Day (discussion in three rounds), and Voting (suspicious player voted out) proceed sequentially. The winner is decided by the voting result.
Figure 2: Game tree of the game with discussion. P1, P2, and P3 represent Player 1, Player 2, and Player 3, respectively. The dot lines in the Day phase represent Player 3's potential speeches. Those decision nodes on the same dash lines are in the same information sets for corresponding players. The utilities on leaf nodes are organized by the index of players.
Figure 3: Overview of the RL-instructed LLM-based agent framework. (1) Belief Modeling: form beliefs on players' roles based on the observation. (2) Discussion Tactic Selection: utilize a discussion policy trained by RL to choose a discussion tactic from the candidates. (3) Decision Making: take action based on the observation (also belief and discussion tactic, according to the game phase).
Figure 4: The NashConv value of different agents playing in the three-player ONUW game.
Figure 5: The matrices of Team Village's win rates in different settings. It is clear that the hard setting weakens the advantages of our agent.
...and 2 more figures

Theorems & Definitions (6)

Theorem 4.1
Theorem 4.2
Theorem D.2
proof
Theorem D.3
proof

Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf

TL;DR

Abstract

Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (6)