Social World Models

Xuhui Zhou; Jiarui Liu; Akhila Yerukola; Hyunwoo Kim; Maarten Sap

Social World Models

Xuhui Zhou, Jiarui Liu, Akhila Yerukola, Hyunwoo Kim, Maarten Sap

TL;DR

This work defines Social World Models (SWMs) and introduces S^3AP, a structured representation for social world states that encodes environment state, agent observations, actions, and mental states. By parsing free-text narratives into S^3AP and inducing SWMs, the approach achieves state-of-the-art performance on static ToM benchmarks and improves interactive social reasoning in SOTOPIA-hard tasks, including clear gains when integrating SWMs with first-person decision making. The key findings show that explicit modeling of hidden mental states and the use of structured representations yield substantial performance gains across diverse, multi-agent scenarios, while revealing that parser quality and action integration are critical for success. The work argues for a foundational, general-purpose representation to enable more socially-aware AI, while acknowledging challenges in parsing faithfulness, long-horizon dynamics, computation, and safety considerations for real-world deployment.

Abstract

Humans intuitively navigate social interactions by simulating unspoken dynamics and reasoning about others' perspectives, even with limited information. In contrast, AI systems struggle to structure and reason about implicit social contexts, as they lack explicit representations for unobserved dynamics such as intentions, beliefs, and evolving social states. In this paper, we introduce the concept of social world models (SWMs) to characterize the complex social dynamics. To operationalize SWMs, we introduce a novel structured social world representation formalism (S3AP), which captures the evolving states, actions, and mental states of agents, addressing the lack of explicit structure in traditional free-text-based inputs. Through comprehensive experiments across five social reasoning benchmarks, we show that S3AP significantly enhances LLM performance-achieving a +51% improvement on FANToM over OpenAI's o1. Our ablations further reveal that these gains are driven by the explicit modeling of hidden mental states, which proves more effective than a wide range of baseline methods. Finally, we introduce an algorithm for social world models using S3AP, which enables AI agents to build models of their interlocutors and predict their next actions and mental states. Empirically, S3AP-enabled social world models yield up to +18% improvement on the SOTOPIA multi-turn social interaction benchmark. Our findings highlight the promise of S3AP as a powerful, general-purpose representation for social world states, enabling the development of more socially-aware systems that better navigate social interactions.

Social World Models

TL;DR

Abstract

Social World Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)