A Differentiable Physics Engine for Deep Learning in Robotics
Jonas Degrave, Michiel Hermans, Joni Dambre, Francis wyffels
TL;DR
This work addresses the bottleneck of optimizing robotic controllers with non-differentiable physics by introducing a differentiable 3D rigid-body engine implemented in Theano. By enabling analytic gradients and backpropagation through time, the authors demonstrate gradient-based optimization of controllers, including networks with millions of parameters, and show substantial speedups over derivative-free methods. They validate the approach across tasks such as ball throwing, a 4-DOF robot arm, quadruped gait, and a vision-based pendulum, illustrating scalability, GPU batch efficiency, and end-to-end differentiability through perception. The results suggest a practical path for integrating deep learning with physics-driven robotics, offering an alternative to deep Q-learning and enabling faster hardware-software co-design and learning from complex sensors like cameras.
Abstract
An important field in robotics is the optimization of controllers. Currently, robots are often treated as a black box in this optimization process, which is the reason why derivative-free optimization methods such as evolutionary algorithms or reinforcement learning are omnipresent. When gradient-based methods are used, models are kept small or rely on finite difference approximations for the Jacobian. This method quickly grows expensive with increasing numbers of parameters, such as found in deep learning. We propose the implementation of a modern physics engine, which can differentiate control parameters. This engine is implemented for both CPU and GPU. Firstly, this paper shows how such an engine speeds up the optimization process, even for small problems. Furthermore, it explains why this is an alternative approach to deep Q-learning, for using deep learning in robotics. Finally, we argue that this is a big step for deep learning in robotics, as it opens up new possibilities to optimize robots, both in hardware and software.
