Reference-Free Sampling-Based Model Predictive Control
Fabian Schramm, Pierre Fabre, Nicolas Perrin-Gilbert, Justin Carpentier
TL;DR
The paper addresses the problem of enabling emergent locomotion without gait references or offline pre-training. It proposes a reference-free MPPI framework with a dual-space Hermite spline parameterization and diffusion-inspired noise annealing to explore joint-position and velocity trajectories on CPU hardware. A best-trajectory tracking mechanism and real-time state prediction with warm-starting address robustness under computation delays. Experiments validate emergent gaits, jumping, and balancing on the Go2 quadruped and in simulation on a humanoid, achieving real-time performance with as few as $30$ samples per step and demonstrating cross-platform generality.
Abstract
We present a sampling-based model predictive control (MPC) framework that enables emergent locomotion without relying on handcrafted gait patterns or predefined contact sequences. Our method discovers diverse motion patterns, ranging from trotting to galloping, robust standing policies, jumping, and handstand balancing, purely through the optimization of high-level objectives. Building on model predictive path integral (MPPI), we propose a dual-space spline parameterization that operates on position and velocity control points. Our approach enables contact-making and contact-breaking strategies that adapt automatically to task requirements, requiring only a limited number of sampled trajectories. This sample efficiency allows us to achieve real-time control on standard CPU hardware, eliminating the need for GPU acceleration typically required by other state-of-the-art MPPI methods. We validate our approach on the Go2 quadrupedal robot, demonstrating various emergent gaits and basic jumping capabilities. In simulation, we further showcase more complex behaviors, such as backflips, dynamic handstand balancing and locomotion on a Humanoid, all without requiring reference tracking or offline pre-training.
