Language and Experience: A Computational Model of Social Learning in Complex Tasks
Cédric Colas, Tracey Mills, Ben Prystawski, Michael Henry Tessler, Noah Goodman, Jacob Andreas, Joshua Tenenbaum
TL;DR
This work presents a Bayesian framework that unifies linguistic guidance and direct experience to learn executable world models in complex tasks. By treating language as evidence via a probabilistic speaker model and using LM-enabled guidance to shape inference and proposals, the approach achieves sample-efficient learning and robust knowledge transfer across humans and models. Across 10 VGDL-based video games, linguistic guidance accelerates exploration, reduces dangerous interactions, and enables generational and bidirectional transfer of knowledge, illustrating a path toward effective human–AI collaborative learning. The findings highlight the potential of language-augmented, theory-based agents to teach, adapt, and co-evolve within social learning networks, with implications for scalable, collaborative intelligence.
Abstract
The ability to combine linguistic guidance from others with direct experience is central to human development, enabling safe and rapid learning in new environments. How do people integrate these two sources of knowledge, and how might AI systems? We present a computational framework that models social learning as joint probabilistic inference over structured, executable world models given sensorimotor and linguistic data. We make this possible by turning a pretrained language model into a probabilistic model of how humans share advice conditioned on their beliefs, allowing our agents both to generate advice for others and to interpret linguistic input as evidence during Bayesian inference. Using behavioral experiments and simulations across 10 video games, we show how linguistic guidance can shape exploration and accelerate learning by reducing risky interactions and speeding up key discoveries in both humans and models. We further explore how knowledge can accumulate across generations through iterated learning experiments and demonstrate successful knowledge transfer between humans and models -- revealing how structured, language-compatible representations might enable human-machine collaborative learning.
