Wanting to be Understood
Chrisantha Fernando, Dylan Banarse, Simon Osindero
TL;DR
This work investigates whether intrinsic motivations for mutual understanding—beyond extrinsic rewards—can drive social interaction in multi-agent systems. Using a PCP-based 1D surveillance/testbed, agents are trained with PPO and an LSTM predictor, comparing artificial curiosity to three reciprocal drives: imitation/imitation-by, influence/impressionability, and sub-reaction time anticipation. The results show that reciprocal drives that reward being understood and understanding others yield sustained self–other coordination and can enable cooperation in tasks where only one agent receives external rewards. This suggests a plausible computational account for early intersubjectivity and has implications for designing cooperative AI that can bootstrap joint behavior without explicit task-shared incentives.
Abstract
This paper explores an intrinsic motivation for mutual awareness, hypothesizing that humans possess a fundamental drive to understand and to be understood even in the absence of extrinsic rewards. Through simulations of the perceptual crossing paradigm, we explore the effect of various internal reward functions in reinforcement learning agents. The drive to understand is implemented as an active inference type artificial curiosity reward, whereas the drive to be understood is implemented through intrinsic rewards for imitation, influence/impressionability, and sub-reaction time anticipation of the other. Results indicate that while artificial curiosity alone does not lead to a preference for social interaction, rewards emphasizing reciprocal understanding successfully drive agents to prioritize interaction. We demonstrate that this intrinsic motivation can facilitate cooperation in tasks where only one agent receives extrinsic reward for the behaviour of the other.
