Bridging the Sim-to-Real Gap with Bayesian Inference

Jonas Rothfuss; Bhavya Sukhija; Lenart Treven; Florian Dörfler; Stelian Coros; Andreas Krause

Bridging the Sim-to-Real Gap with Bayesian Inference

Jonas Rothfuss, Bhavya Sukhija, Lenart Treven, Florian Dörfler, Stelian Coros, Andreas Krause

TL;DR

It is empirically show that learning with implicit physical priors results in accurate mean model estimation as well as precise uncertainty quantification in Sim-FSVGD, and demonstrates the effectiveness of Sim-FSVGD in bridging the sim-to-real gap on a high-performance RC racecar system.

Abstract

We present SIM-FSVGD for learning robot dynamics from data. As opposed to traditional methods, SIM-FSVGD leverages low-fidelity physical priors, e.g., in the form of simulators, to regularize the training of neural network models. While learning accurate dynamics already in the low data regime, SIM-FSVGD scales and excels also when more data is available. We empirically show that learning with implicit physical priors results in accurate mean model estimation as well as precise uncertainty quantification. We demonstrate the effectiveness of SIM-FSVGD in bridging the sim-to-real gap on a high-performance RC racecar system. Using model-based RL, we demonstrate a highly dynamic parking maneuver with drifting, using less than half the data compared to the state of the art.

Bridging the Sim-to-Real Gap with Bayesian Inference

TL;DR

Abstract

Paper Structure (21 sections, 5 equations, 7 figures, 2 algorithms)

This paper contains 21 sections, 5 equations, 7 figures, 2 algorithms.

Introduction
Related Work.
Bridging the sim-to-real gap
Neural Networks for System Identification
Bayesian Neural Networks
Background
Learning Dynamics Models with BNNs
BNN Inference in the Function Space
Sim-FSVGD
Illustrative Example: the Pendulum
Main Algorithm
Domain-model process
Sim-to-real prior
Estimating the Stochastic Process Score
Experiments
...and 6 more sections

Figures (7)

Figure 1: Dynamic RC car from our experiments.
Figure 2: BNN posteriors trained on two data points for a one-dimensional sinusoidal function. GreyBox finds accurate mean, but has large uncertainty estimates. At the same time Sim-FSVGD obtains both accurate mean and uncertainty estimates.
Figure 3: We benchmark NLL of dynamics model on test data for Racecar on hardware and in simulations and Pendulum in simulation. BNNs with simulation priors achieve low NLL already in small data regimes and constantly outperform the standard BNNs when we increase the number of train data points.
Figure 4: Desired reverse parking maneuver which involves rotating the car 180° and parking ca. 2m away.
Figure 5: Realized trajectory for Racecar environment in simulation after episode 3. In violet, we have the trajectory of the model with FSVGD (no prior), and in red the trajectory of Sim-FSVGD using the low-fidelity prior. We observe that, Sim-FSVGD already learns to get close to the target in three episodes, whereas FSVGD (no prior) explores away from it.
...and 2 more figures

Bridging the Sim-to-Real Gap with Bayesian Inference

TL;DR

Abstract

Bridging the Sim-to-Real Gap with Bayesian Inference

Authors

TL;DR

Abstract

Table of Contents

Figures (7)