TossingBot: Learning to Throw Arbitrary Objects with Residual Physics

Andy Zeng; Shuran Song; Johnny Lee; Alberto Rodriguez; Thomas Funkhouser

TossingBot: Learning to Throw Arbitrary Objects with Residual Physics

Andy Zeng, Shuran Song, Johnny Lee, Alberto Rodriguez, Thomas Funkhouser

TL;DR

This work introduces TossingBot, a robot system that learns to pick arbitrary objects from an unstructured bin and throw them into distant targets outside its reach. By coupling a physics-based ballistic controller with a learned residual velocity per grasp, the authors present Residual Physics, a hybrid end-to-end model that jointly optimizes grasping and throwing from visual input via self-supervised trial-and-error. The approach achieves high throughput (500+ mean picks per hour) and robust generalization to unseen objects and target locations, outperforming purely physics-based or purely data-driven baselines. Analyses reveal that the network implicitly learns meaningful object semantics and that supervising grasps with throwing success yields more stable, effective grasps for accurate throws.

Abstract

We investigate whether a robot arm can learn to pick and throw arbitrary objects into selected boxes quickly and accurately. Throwing has the potential to increase the physical reachability and picking speed of a robot arm. However, precisely throwing arbitrary objects in unstructured settings presents many challenges: from acquiring reliable pre-throw conditions (e.g. initial pose of object in manipulator) to handling varying object-centric properties (e.g. mass distribution, friction, shape) and dynamics (e.g. aerodynamics). In this work, we propose an end-to-end formulation that jointly learns to infer control parameters for grasping and throwing motion primitives from visual observations (images of arbitrary objects in a bin) through trial and error. Within this formulation, we investigate the synergies between grasping and throwing (i.e., learning grasps that enable more accurate throws) and between simulation and deep learning (i.e., using deep networks to predict residuals on top of control parameters predicted by a physics simulator). The resulting system, TossingBot, is able to grasp and throw arbitrary objects into boxes located outside its maximum reach range at 500+ mean picks per hour (600+ grasps per hour with 85% throwing accuracy); and generalizes to new objects and target locations. Videos are available at https://tossingbot.cs.princeton.edu

TossingBot: Learning to Throw Arbitrary Objects with Residual Physics

TL;DR

Abstract

TossingBot: Learning to Throw Arbitrary Objects with Residual Physics

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)