RoboCopilot: Human-in-the-loop Interactive Imitation Learning for Robot Manipulation
Philipp Wu, Yide Shentu, Qiayuan Liao, Ding Jin, Menglong Guo, Koushil Sreenath, Xingyu Lin, Pieter Abbeel
TL;DR
RoboCopilot tackles the inefficiency of passive imitation learning by introducing a human-in-the-loop interactive imitation learning framework for bi-manual manipulation. It combines HG-DAgger with a compliant, bilateral teleoperation system and a continual learning loop, enabling seamless handovers and targeted corrective demonstrations. Through simulation and real-world experiments across picking, transport, and long-horizon tasks, the approach demonstrates improved data quality and higher task success with fewer human interventions, especially when using Batched DAgger. The work highlights a practical, cost-conscious path to scalable interactive learning for contact-rich robotics with long-horizon goals.
Abstract
Learning from human demonstration is an effective approach for learning complex manipulation skills. However, existing approaches heavily focus on learning from passive human demonstration data for its simplicity in data collection. Interactive human teaching has appealing theoretical and practical properties, but they are not well supported by existing human-robot interfaces. This paper proposes a novel system that enables seamless control switching between human and an autonomous policy for bi-manual manipulation tasks, enabling more efficient learning of new tasks. This is achieved through a compliant, bilateral teleoperation system. Through simulation and hardware experiments, we demonstrate the value of our system in an interactive human teaching for learning complex bi-manual manipulation skills.
