PATO: Policy Assisted TeleOperation for Scalable Robot Data Collection
Shivin Dass, Karl Pertsch, Hejia Zhang, Youngwoon Lee, Joseph J. Lim, Stefanos Nikolaidis
TL;DR
The paper addresses the problem of inefficiently slow and costly large-scale robotic data collection by introducing Policy Assisted TeleOperation (PATO), a hierarchical, uncertainty-aware assistive system. PATO learns from multi-modal, diverse demonstrations to automate repetitive subtasks via a high-level subgoal predictor and a low-level subgoal-reaching policy, while actively deciding when human input is needed using task and policy uncertainty measures. The key contributions include a conditional-VAE based subgoal predictor, an LSTM-based low-level controller, an ensemble-based uncertainty mechanism, and empirical validation showing reduced operator workload and improved throughput in both real-robot and multi-robot simulated settings. This work demonstrates a feasible pathway toward scalable robotic data collection, enabling a single operator to supervise multiple robots and potentially accelerating downstream robot learning pipelines.
Abstract
Large-scale data is an essential component of machine learning as demonstrated in recent advances in natural language processing and computer vision research. However, collecting large-scale robotic data is much more expensive and slower as each operator can control only a single robot at a time. To make this costly data collection process efficient and scalable, we propose Policy Assisted TeleOperation (PATO), a system which automates part of the demonstration collection process using a learned assistive policy. PATO autonomously executes repetitive behaviors in data collection and asks for human input only when it is uncertain about which subtask or behavior to execute. We conduct teleoperation user studies both with a real robot and a simulated robot fleet and demonstrate that our assisted teleoperation system reduces human operators' mental load while improving data collection efficiency. Further, it enables a single operator to control multiple robots in parallel, which is a first step towards scalable robotic data collection. For code and video results, see https://clvrai.com/pato
