Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)
Ruaridh Mon-Williams, Max Taylor-Davies, Elizabeth Mieczkowski, Natalia Velez, Neil R. Bramley, Yanwei Wang, Thomas L. Griffiths, Christopher G. Lucas
TL;DR
The study asks whether partner modelling can spontaneously emerge in model-free recurrent agents engaged in cooperative tasks. By training GRU-based ego agents with diverse partners in Overcooked-AI, the authors show that internal representations of partners' abilities arise without explicit modelling incentives, and that these representations support rapid adaptation to unseen and online-changing partners. The emergence hinges on environmental social pressure, notably when agents can influence partner task allocation, and even blind agents can develop partner models. Collectively, the results demonstrate that social predictive representations can arise from cooperative pressures and general learning mechanisms, highlighting a path toward more flexible, human-like collaboration in AI agents.
Abstract
Humans are remarkably adept at collaboration, able to infer the strengths and weaknesses of new partners in order to work successfully towards shared goals. To build AI systems with this capability, we must first understand its building blocks: does such flexibility require explicit, dedicated mechanisms for modelling others -- or can it emerge spontaneously from the pressures of open-ended cooperative interaction? To investigate this question, we train simple model-free RNN agents to collaborate with a population of diverse partners. Using the `Overcooked-AI' environment, we collect data from thousands of collaborative teams, and analyse agents' internal hidden states. Despite a lack of additional architectural features, inductive biases, or auxiliary objectives, the agents nevertheless develop structured internal representations of their partners' task abilities, enabling rapid adaptation and generalisation to novel collaborators. We investigated these internal models through probing techniques, and large-scale behavioural analysis. Notably, we find that structured partner modelling emerges when agents can influence partner behaviour by controlling task allocation. Our results show that partner modelling can arise spontaneously in model-free agents -- but only under environmental conditions that impose the right kind of social pressure.
