IntervenGen: Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning
Ryan Hoque, Ajay Mandlekar, Caelan Garrett, Ken Goldberg, Dieter Fox
TL;DR
IntervenGen (I-Gen) addresses distribution shift in robot imitation learning by autonomously generating a large, diverse set of corrective interventional data from a small number of human interventions. The framework combines closed-loop mistake generation with open-loop recovery replay to synthesize interventional trajectories, greatly expanding state-space coverage while reducing human labeling burden. Evaluations across four simulated tasks and one physical task show up to 39x robustness gains with only 10 interventions, and strong sim-to-real transfer capabilities, highlighting significant improvements in data efficiency and robustness for high-precision manipulation under perception noise.
Abstract
Imitation learning is a promising paradigm for training robot control policies, but these policies can suffer from distribution shift, where the conditions at evaluation time differ from those in the training data. A popular approach for increasing policy robustness to distribution shift is interactive imitation learning (i.e., DAgger and variants), where a human operator provides corrective interventions during policy rollouts. However, collecting a sufficient amount of interventions to cover the distribution of policy mistakes can be burdensome for human operators. We propose IntervenGen (I-Gen), a novel data generation system that can autonomously produce a large set of corrective interventions with rich coverage of the state space from a small number of human interventions. We apply I-Gen to 4 simulated environments and 1 physical environment with object pose estimation error and show that it can increase policy robustness by up to 39x with only 10 human interventions. Videos and more results are available at https://sites.google.com/view/intervengen2024.
