MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Cultural Learning
Mircea Lică, Ojas Shirekar, Baptiste Colle, Chirag Raman
TL;DR
MindForge tackles the limitations of open-weight LLMs in embodied, open-ended environments by endowing agents with explicit Theory of Mind, a natural-language communication channel, and a triadic memory system. By integrating a structured Belief-Desire-Intention framework through the BigToM causal template, agents model both self and partner perspectives and exchange information to refine their beliefs during test-time dialogue. Empirical results in Minecraft show that open-weight LLMs, when collaborating with ToM-enabled peers or experts, close much of the gap to GPT-4 on basic tasks and achieve substantial gains on lifelong learning milestones, with performance scaling with the number of communication rounds. The work also demonstrates a Condorcet-like population boost, suggesting that richer social interaction can yield emergent improvements without increasing model size, aligning with Green AI goals for scalable, accessible embodied AI.
Abstract
Embodied agents powered by large language models (LLMs), such as Voyager, promise open-ended competence in worlds such as Minecraft. However, when powered by open-weight LLMs they still falter on elementary tasks after domain-specific fine-tuning. We propose MindForge, a generative-agent framework for cultural lifelong learning through explicit perspective taking. We introduce three key innovations: (1) a structured theory of mind representation linking percepts, beliefs, desires, and actions; (2) natural inter-agent communication; and (3) a multi-component memory system. Following the cultural learning framework, we test MindForge in both instructive and collaborative settings within Minecraft. In an instructive setting with GPT-4, MindForge agents powered by open-weight LLMs significantly outperform their Voyager counterparts in basic tasks yielding $3\times$ more tech-tree milestones and collecting $2.3\times$ more unique items than the Voyager baseline. Furthermore, in fully \textit{collaborative} settings, we find that the performance of two underachieving agents improves with more communication rounds, echoing the Condorcet Jury Theorem. MindForge agents demonstrate sophisticated behaviors, including expert-novice knowledge transfer, collaborative problem solving, and adaptation to out-of-distribution tasks through accumulated cultural experiences.
