Visual Theory of Mind Enables the Invention of Proto-Writing
Benjamin A. Spiegel, Lucas Gelfond, George Konidaris
TL;DR
This work investigates how proto-writing could emerge by combining naturalistic, multi-agent reinforcement learning with inferential communication grounded in visual theory of mind. It introduces Signification Games to study the invention and evolution of pictographs, revealing a signification gap that limits purely behaviorist signaling. A two-part model shows that incorporating a Bayesian-like inferential mechanism and a referent-sensitivity term $P_{ref}$ enables communication of visually complex referents with crude pictographs and explains the observed shift from iconic to abstract signs over time. The results suggest that visual theory of mind is a crucial cognitive mechanism enabling the invention of the first writing-like signs and establish Signification Games as a general tool for exploring communication across media and species.
Abstract
Symbolic writing systems are graphical semiotic codes that are ubiquitous in modern society but are otherwise absent in the animal kingdom. Anthropological evidence suggests that the earliest forms of some writing systems originally consisted of iconic pictographs, which signify their referent via visual resemblance. While previous studies have examined the emergence and, separately, the evolution of pictographic systems through a computational lens, most employ non-naturalistic methodologies that make it difficult to draw clear analogies to human and animal cognition. We develop a multi-agent reinforcement learning testbed for emergent communication called a Signification Game, and formulate a model of inferential communication that enables agents to leverage visual theory of mind to communicate actions using pictographs. Our model, which is situated within a broader formalism for animal communication, sheds light on the cognitive and cultural processes underlying the emergence of proto-writing.
