An Introduction to Zero-Order Optimization Techniques for Robotics
Armand Jordana, Jianghan Zhang, Joseph Amigo, Ludovic Righetti
TL;DR
This work addresses the challenge of derivative-free optimization in robotics by presenting a unified mathematical framework for zero-order methods that connect trajectory optimization and reinforcement learning. It develops a random search-based approach, incorporating Gaussian smoothing and the log-sum-exp MPPI transform, and shows how these ideas subsume and relate to CMA-ES, SPSA, and other gradient-free techniques. The authors demonstrate connections between TO and RL within this framework and derive competitive new RL-style algorithms, highlighting the role of stochasticity in escaping local minima. The results suggest practical benefits for non-differentiable, contact-rich robotics tasks and point toward future directions such as constrained zero-order optimization and scalable parallelization for online and offline settings.
Abstract
Zero-order optimization techniques are becoming increasingly popular in robotics due to their ability to handle non-differentiable functions and escape local minima. These advantages make them particularly useful for trajectory optimization and policy optimization. In this work, we propose a mathematical tutorial on random search. It offers a simple and unifying perspective for understanding a wide range of algorithms commonly used in robotics. Leveraging this viewpoint, we classify many trajectory optimization methods under a common framework and derive novel competitive RL algorithms.
