Sample-Efficient and Smooth Cross-Entropy Method Model Predictive Control Using Deterministic Samples
Markus Walker, Daniel Frisch, Uwe D. Hanebeck
TL;DR
dsCEM is proposed, a novel framework that replaces the random sampling step with deterministic samples derived from localized cumulative distributions (LCDs) and introduces modular schemes to generate and adapt these sample sets, incorporating temporal correlations to ensure smooth control trajectories.
Abstract
Cross-entropy method model predictive control (CEM--MPC) is a powerful gradient-free technique for nonlinear optimal control, but its performance is often limited by the reliance on random sampling. This conventional approach can lead to inefficient exploration of the solution space and non-smooth control inputs, requiring a large number of samples to achieve satisfactory results. To address these limitations, we propose deterministic sampling CEM (dsCEM), a novel framework that replaces the random sampling step with deterministic samples derived from localized cumulative distributions (LCDs). Our approach introduces modular schemes to generate and adapt these sample sets, incorporating temporal correlations to ensure smooth control trajectories. This method can be used as a drop-in replacement for the sampling step in existing CEM-based controllers. Experimental evaluations on two nonlinear control tasks demonstrate that dsCEM consistently outperforms state-of-the-art iCEM in terms of cumulative cost and control input smoothness, particularly in the critical low-sample regime.
