Mimicry and the Emergence of Cooperative Communication
Dylan Cope, Peter McBurney
TL;DR
This work tackles the problem of how cooperative communication can emerge among co-evolving agents by leveraging mimicry of externally generated, useful signals. It combines theoretical analysis of independent versus centralised optimisation with empirical tests in a gridworld, using deep neuroevolution and MAPPO to compare dynamics with and without mimicable signals. The key finding is that mimicry can alter optimisation trajectories, helping systems escape non-communicative local optima and fostering the emergence of communication, though the benefits depend on signal source disambiguation and can entail trade-offs in later refinement. The results offer a principled mechanism to bootstrap communication in multi-agent systems, with implications for designing cooperative AI and understanding language emergence, while outlining directions for handling initialisation effects and potential negative consequences of signal mimicry.
Abstract
In many situations, communication between agents is a critical component of cooperative multi-agent systems, however, it can be difficult to learn or evolve. In this paper, we investigate a simple way in which the emergence of communication may be facilitated. Namely, we explore the effects of when agents can mimic preexisting, externally generated useful signals. The key idea here is that these signals incentivise listeners to develop positive responses, that can then also be invoked by speakers mimicking those signals. This investigation starts with formalising this problem, and demonstrating that this form of mimicry changes optimisation dynamics and may provide the opportunity to escape non-communicative local optima. We then explore the problem empirically with a simulation in which spatially situated agents must communicate to collect resources. Our results show that both evolutionary optimisation and reinforcement learning may benefit from this intervention.
