Efficient Learning of Object Placement with Intra-Category Transfer
Adrian Röfer, Russell Buchanan, Max Argus, Sethu Vijayakumar, Abhinav Valada
TL;DR
This work tackles efficient, few-shot learning of long-horizon object placement by transferring observed arrangements to novel instances within a canonical class frame. It introduces canonical class mappings, relative pose distributions in learned feature spaces, and an entropy-guided pose-encoding with model minimization to suppress distractors, enabling intra-category transfer. The approach yields strong simulated performance and demonstrates real-world table-setting with unseen objects, achieving 73.3% of human baseline in human evaluations. The authors also provide a perception-driven real-robot pipeline and release code and datasets publicly, underscoring practical viability for autonomous arrangement tasks.
Abstract
Efficient learning from demonstration for long-horizon tasks remains an open challenge in robotics. While significant effort has been directed toward learning trajectories, a recent resurgence of object-centric approaches has demonstrated improved sample efficiency, enabling transferable robotic skills. Such approaches model tasks as a sequence of object poses over time. In this work, we propose a scheme for transferring observed object arrangements to novel object instances by learning these arrangements on canonical class frames. We then employ this scheme to enable a simple yet effective approach for training models from as few as five demonstrations to predict arrangements of a wide range of objects including tableware, cutlery, furniture, and desk spaces. We propose a method for optimizing the learned models to enable efficient learning of tasks such as setting a table or tidying up an office with intra-category transfer, even in the presence of distractors. We present extensive experimental results in simulation and on a real robotic system for table setting which, based on human evaluations, scored 73.3% compared to a human baseline. We make the code and trained models publicly available at https://oplict.cs.uni-freiburg.de.
