Beyond the Majority: Long-tail Imitation Learning for Robotic Manipulation
Junhong Zhu, Ji Zhang, Jingkuan Song, Lianli Gao, Heng Tao Shen
TL;DR
This work tackles the long-tail data distribution problem in imitation learning for generalist robotic manipulation policies, showing that tail tasks suffer from degraded spatial reasoning due to data scarcity. It identifies phase-specific failure, particularly in the target-approaching phase, as the main bottleneck and proposes Approaching-Phase Augmentation (APA), which transfers head-task approaching-phase knowledge to tail tasks by head-to-tail object grafting and linguistically aligned co-training. The authors validate APA with extensive simulation on LIBERO-Core-LT and real-world experiments (Real-World-LT), demonstrating significant gains for tail tasks without sacrificing head-task performance. The results highlight APA’s practical appeal, its robustness to task-distribution changes, and its potential to make broad, data-efficient generalist policies more reliable in real-world manipulation scenarios.
Abstract
While generalist robot policies hold significant promise for learning diverse manipulation skills through imitation, their performance is often hindered by the long-tail distribution of training demonstrations. Policies learned on such data, which is heavily skewed towards a few data-rich head tasks, frequently exhibit poor generalization when confronted with the vast number of data-scarce tail tasks. In this work, we conduct a comprehensive analysis of the pervasive long-tail challenge inherent in policy learning. Our analysis begins by demonstrating the inefficacy of conventional long-tail learning strategies (e.g., re-sampling) for improving the policy's performance on tail tasks. We then uncover the underlying mechanism for this failure, revealing that data scarcity on tail tasks directly impairs the policy's spatial reasoning capability. To overcome this, we introduce Approaching-Phase Augmentation (APA), a simple yet effective scheme that transfers knowledge from data-rich head tasks to data-scarce tail tasks without requiring external demonstrations. Extensive experiments in both simulation and real-world manipulation tasks demonstrate the effectiveness of APA. Our code and demos are publicly available at: https://mldxy.github.io/Project-VLA-long-tail/.
