Intrinsically-Motivated Humans and Agents in Open-World Exploration
Aly Lidayan, Yuqing Du, Eliza Kosoy, Maria Rufova, Pieter Abbeel, Alison Gopnik
TL;DR
This study investigates intrinsic motivation in open-ended exploration by directly comparing adults, children, and RL agents in Crafter, a Minecraft-like environment, across entropy, information gain, and empowerment objectives. It finds that Entropy and Empowerment reliably correlate with human exploration progress, while Information Gain shows weaker associations; Entropy tends to rise quickly and plateau, whereas Empowerment grows steadily, implying a staged exploration strategy. Agents with intrinsic rewards underperform compared to extrinsic baselines, highlighting the need for better intrinsic reward designs that approximate the human-aligned objectives. The work also suggests that private speech, especially goal verbalizations, may aid children's exploration, pointing to a potential role of language in guiding exploration. The dataset and code are publicly available to spur further cross-disciplinary research in cognitive science and AI.
Abstract
What drives exploration? Understanding intrinsic motivation is a long-standing challenge in both cognitive science and artificial intelligence; numerous objectives have been proposed and used to train agents, yet there remains a gap between human and agent exploration. We directly compare adults, children, and AI agents in a complex open-ended environment, Crafter, and study how common intrinsic objectives: Entropy, Information Gain, and Empowerment, relate to their behavior. We find that only Entropy and Empowerment are consistently positively correlated with human exploration progress, indicating that these objectives may better inform intrinsic reward design for agents. Furthermore, across agents and humans we observe that Entropy initially increases rapidly, then plateaus, while Empowerment increases continuously, suggesting that state diversity may provide more signal in early exploration, while advanced exploration should prioritize control. Finally, we find preliminary evidence that private speech utterances, and particularly goal verbalizations, may aid exploration in children. Our data is available at https://github.com/alyd/humans_in_crafter_data.
