BayesSim: adaptive domain randomization via probabilistic inference for robotics simulators
Fabio Ramos, Rafael Carvalhaes Possas, Dieter Fox
TL;DR
BayesSim addresses the reality gap in robotics by learning a full posterior over simulator parameters from limited real-world observations using likelihood-free inference. It introduces a flexible, MDN-like model with both neural and quasi-Monte Carlo random Fourier features to approximate $q_oldsymbol{ heta}(m{ heta}|oldsymbol{x})$, and shows how to recover the posterior under mismatched priors. Domain randomization guided by the inferred posterior yields policies that generalize more robustly across parameter variations, outperforming uniform-prior DR in several control tasks. The approach treats simulators as black boxes, enabling principled Bayesian system identification and principled policy training for improved Sim2Real transfer, with future work extending to image-based and end-to-end representations.
Abstract
We introduce BayesSim, a framework for robotics simulations allowing a full Bayesian treatment for the parameters of the simulator. As simulators become more sophisticated and able to represent the dynamics more accurately, fundamental problems in robotics such as motion planning and perception can be solved in simulation and solutions transferred to the physical robot. However, even the most complex simulator might still not be able to represent reality in all its details either due to inaccurate parametrization or simplistic assumptions in the dynamic models. BayesSim provides a principled framework to reason about the uncertainty of simulation parameters. Given a black box simulator (or generative model) that outputs trajectories of state and action pairs from unknown simulation parameters, followed by trajectories obtained with a physical robot, we develop a likelihood-free inference method that computes the posterior distribution of simulation parameters. This posterior can then be used in problems where Sim2Real is critical, for example in policy search. We compare the performance of BayesSim in obtaining accurate posteriors in a number of classical control and robotics problems. Results show that the posterior computed from BayesSim can be used for domain randomization outperforming alternative methods that randomize based on uniform priors.
