MATCH POLICY: A Simple Pipeline from Point Cloud Registration to Manipulation Policies
Haojie Huang, Haotian Liu, Dian Wang, Robin Walters, Robert Platt
TL;DR
MATCH POLICY reframes robotic manipulation as a point-cloud registration problem, transferring policy inference from action prediction to registration of pick/place targets against demonstrations. It stores combined point clouds derived from demonstrations and uses optimization-based registration (RANSAC plus colored ICP) to infer multi-step, open-loop actions via relative transforms, achieving high-precision performance with minimal demonstrations. The method leverages equivariant and bi-equivariant properties to enhance sample efficiency and generalization across unseen configurations, cameras, and tasks, with strong results on RLBench benchmarks and successful real-robot deployments. Practically, this approach offers a training-free, plug-and-play tool for industrial-style pick-and-place, capable of handling long horizons and articulated objects, while highlighting limitations related to segmentation requirements and open-loop execution.
Abstract
Many manipulation tasks require the robot to rearrange objects relative to one another. Such tasks can be described as a sequence of relative poses between parts of a set of rigid bodies. In this work, we propose MATCH POLICY, a simple but novel pipeline for solving high-precision pick and place tasks. Instead of predicting actions directly, our method registers the pick and place targets to the stored demonstrations. This transfers action inference into a point cloud registration task and enables us to realize nontrivial manipulation policies without any training. MATCH POLICY is designed to solve high-precision tasks with a key-frame setting. By leveraging the geometric interaction and the symmetries of the task, it achieves extremely high sample efficiency and generalizability to unseen configurations. We demonstrate its state-of-the-art performance across various tasks on RLBench benchmark compared with several strong baselines and test it on a real robot with six tasks.
