Learning with AMIGo: Adversarially Motivated Intrinsic Goals
Andres Campero, Roberta Raileanu, Heinrich Küttler, Joshua B. Tenenbaum, Tim Rocktäschel, Edward Grefenstette
TL;DR
AMIGo introduces a meta-learning framework in which a goal-generating teacher provides adversarially motivated intrinsic goals to a goal-conditioned student, creating an automatic curriculum that enhances exploration under sparse rewards. The teacher and student optimize an adversarial yet constructive objective, enabling the agent to progressively tackle harder tasks in procedurally generated MiniGrid environments. Across 114 experiments and six tasks, AMIGo delivers state-of-the-art results on challenging environments and demonstrates improved sample efficiency relative to prior intrinsic-motivation methods. The work provides a model-agnostic approach to improving exploration in RL with potential extensions to language goals, partial observability, and continuous-control settings.
Abstract
A key challenge for reinforcement learning (RL) consists of learning in environments with sparse extrinsic rewards. In contrast to current RL methods, humans are able to learn new skills with little or no reward by using various forms of intrinsic motivation. We propose AMIGo, a novel agent incorporating -- as form of meta-learning -- a goal-generating teacher that proposes Adversarially Motivated Intrinsic Goals to train a goal-conditioned "student" policy in the absence of (or alongside) environment reward. Specifically, through a simple but effective "constructively adversarial" objective, the teacher learns to propose increasingly challenging -- yet achievable -- goals that allow the student to learn general skills for acting in a new environment, independent of the task to be solved. We show that our method generates a natural curriculum of self-proposed goals which ultimately allows the agent to solve challenging procedurally-generated tasks where other forms of intrinsic motivation and state-of-the-art RL methods fail.
