Learning a Thousand Tasks in a Day

Kamil Dreczkowski; Pietro Vitiello; Vitalis Vosylius; Edward Johns

Learning a Thousand Tasks in a Day

Kamil Dreczkowski, Pietro Vitiello, Vitalis Vosylius, Edward Johns

TL;DR

Learning a Thousand Tasks in a Day tackles data-inefficiency in robotic imitation by proposing two priors: trajectory decomposition into alignment and interaction, and retrieval-based generalisation. The authors introduce MT3, a fully retrieval-based decomposition method, and validate it across 3,450 real-world rollouts and a large-scale 1,000-task evaluation with single demonstrations, revealing strong data efficiency and meaningful generalisation, along with limitations of open-loop interaction. In controlled tests, MT3 outperforms monolithic behavioural cloning in the few-shot regime, while decomposition provides rapid early gains that may plateau with abundant data. The work offers practical guidance for scalable robot learning, highlighting when retrieval-based decomposition is advantageous and outlining avenues to address open-loop and perception-based limitations in real-world manipulation.

Abstract

Humans are remarkably efficient at learning tasks from demonstrations, but today's imitation learning methods for robot manipulation often require hundreds or thousands of demonstrations per task. We investigate two fundamental priors for improving learning efficiency: decomposing manipulation trajectories into sequential alignment and interaction phases, and retrieval-based generalisation. Through 3,450 real-world rollouts, we systematically study this decomposition. We compare different design choices for the alignment and interaction phases, and examine generalisation and scaling trends relative to today's dominant paradigm of behavioural cloning with a single-phase monolithic policy. In the few-demonstrations-per-task regime (<10 demonstrations), decomposition achieves an order of magnitude improvement in data efficiency over single-phase learning, with retrieval consistently outperforming behavioural cloning for both alignment and interaction. Building on these insights, we develop Multi-Task Trajectory Transfer (MT3), an imitation learning method based on decomposition and retrieval. MT3 learns everyday manipulation tasks from as little as a single demonstration each, whilst also generalising to novel object instances. This efficiency enables us to teach a robot 1,000 distinct everyday tasks in under 24 hours of human demonstrator time. Through 2,200 additional real-world rollouts, we reveal MT3's capabilities and limitations across different task families. Videos of our experiments can be found on at https://www.robot-learning.uk/learning-1000-tasks.

Learning a Thousand Tasks in a Day

TL;DR

Abstract

Learning a Thousand Tasks in a Day

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)