Planar Robot Casting with Real2Sim2Real Self-Supervised Learning
Vincent Lim, Huang Huang, Lawrence Yunliang Chen, Jonathan Wang, Jeffrey Ichnowski, Daniel Seita, Michael Laskey, Ken Goldberg
TL;DR
This work tackles Planar Robot Casting (PRC), where a single dynamic wrist motion moves a cable's free end toward a target beyond the robot's reach. It introduces Real2Sim2Real (R2S2R), a self-supervised pipeline that collects physical trajectories, tunes simulators to real data via Differential Evolution, generates vast simulated data, and trains policies from a weighted mix of real and simulated samples. The approach yields PRC policies with median endpoint errors between 8% and 14% of cable length, outperforming baselines and single-source data policies across three cables and multiple simulators, highlighting the practicality of simulator-tuned data efficiency for deformable-object control. The work demonstrates that modest physical data, when combined with tuned physics simulators, can substantially close the Sim2Real gap for dynamic cable manipulation and offers pathways to extend to 3D, other deformables, and additional manipulation tasks.
Abstract
This paper introduces the task of {\em Planar Robot Casting (PRC)}: where one planar motion of a robot arm holding one end of a cable causes the other end to slide across the plane toward a desired target. PRC allows the cable to reach points beyond the robot workspace and has applications for cable management in homes, warehouses, and factories. To efficiently learn a PRC policy for a given cable, we propose Real2Sim2Real, a self-supervised framework that automatically collects physical trajectory examples to tune parameters of a dynamics simulator using Differential Evolution, generates many simulated examples, and then learns a policy using a weighted combination of simulated and physical data. We evaluate Real2Sim2Real with three simulators, Isaac Gym-segmented, Isaac Gym-hybrid, and PyBullet, two function approximators, Gaussian Processes and Neural Networks (NNs), and three cables with differing stiffness, torsion, and friction. Results with 240 physical trials suggest that the PRC policies can attain median error distance (as % of cable length) ranging from 8% to 14%, outperforming baselines and policies trained on only real or only simulated examples. Code, data, and videos are available at https://tinyurl.com/robotcast.
