Uncertainty-driven Exploration Strategies for Online Grasp Learning

Yitian Shi; Philipp Schillinger; Miroslav Gabriel; Alexander Qualmann; Zohar Feldman; Hanna Ziesche; Ngo Anh Vien

Uncertainty-driven Exploration Strategies for Online Grasp Learning

Yitian Shi, Philipp Schillinger, Miroslav Gabriel, Alexander Qualmann, Zohar Feldman, Hanna Ziesche, Ngo Anh Vien

TL;DR

The online learning algorithm with an effective exploration strategy can significantly improve its adaptation performance to unseen environment settings and various uncertainty estimation schemes based on Bayesian uncertainty quantification and distributional ensembles are proposed.

Abstract

Existing grasp prediction approaches are mostly based on offline learning, while, ignoring the exploratory grasp learning during online adaptation to new picking scenarios, i.e., objects that are unseen or out-of-domain (OOD), camera and bin settings, etc. In this paper, we present an uncertainty-based approach for online learning of grasp predictions for robotic bin picking. Specifically, the online learning algorithm with an effective exploration strategy can significantly improve its adaptation performance to unseen environment settings. To this end, we first propose to formulate online grasp learning as an RL problem that will allow us to adapt both grasp reward prediction and grasp poses. We propose various uncertainty estimation schemes based on Bayesian uncertainty quantification and distributional ensembles. We carry out evaluations on real-world bin picking scenes of varying difficulty. The objects in the bin have various challenging physical and perceptual characteristics that can be characterized by semi- or total transparency, and irregular or curved surfaces. The results of our experiments demonstrate a notable improvement of grasp performance in comparison to conventional online learning methods which incorporate only naive exploration strategies. Video: https://youtu.be/fPKOrjC2QrU

Uncertainty-driven Exploration Strategies for Online Grasp Learning

TL;DR

Abstract

Paper Structure (26 sections, 5 equations, 6 figures)

This paper contains 26 sections, 5 equations, 6 figures.

Introduction
Related works
Modern Robot Grasping Methods
Deep RL-based Grasping Methods
Uncertainty Estimation in Deep RL
Uncertainty-driven Offline-to-Online Robotic Grasp Learning
Problem Formulation
Ensemble of Probabilistic ConvSAC
Gaussian-based Uncertainty Estimation
Quantile-based Distributional Learning
Data-efficient Online Training
Multi-processed Online Learning
UCB-based Exploration Policy
Online Training with Data Buffer
Experiments
...and 11 more sections

Figures (6)

Figure 1: Given an image of a picking scene (1st column), every 500 training steps our MV-ConvSACs predicts a grasp reward map (2nd column) and a normalized uncertainty map (3rd/4th columns).
Figure 2: Ensemble of probabilistic ConvSAC network architecture: MV-ConvSACs and QR-ConvSACs.
Figure 3: Grasping experiment setup
Figure 4: Ablation on uncertainty exploration strategies for MV-ConvSACs and QR-ConvSACs
Figure 5: Online training steps for MV- and QR-ConvSACs
...and 1 more figures

Uncertainty-driven Exploration Strategies for Online Grasp Learning

TL;DR

Abstract

Uncertainty-driven Exploration Strategies for Online Grasp Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (6)