Constrained Stein Variational Trajectory Optimization
Thomas Power, Dmitry Berenson
TL;DR
CSVTO addresses constrained trajectory optimization by treating trajectories as a distribution and using Stein Variational Gradient Descent to sample multiple, low-cost trajectories that satisfy differentiable equality and inequality constraints. The method extends Orthogonal-Space SVGD with tangent-space projections, slack-variable handling for inequalities, a Gauss-Newton–style constraint correction, a repulsive kernel for diversity, annealing for exploration, and a resampling mechanism to escape local minima. Empirical results on a 12DoF quadrotor, a 7DoF table-surface task, and a wrench manipulation task show improved constraint satisfaction and robustness to disturbances, outperforming penalty-based baselines and competitive with, or superior to, IPOPT under constrained computation budgets. The work demonstrates the practicality of online replanning with diverse constraint-satisfying trajectory sets and outlines future improvements in differentiability, computation time, and kernel design for real-time deployment.
Abstract
We present Constrained Stein Variational Trajectory Optimization (CSVTO), an algorithm for performing trajectory optimization with constraints on a set of trajectories in parallel. We frame constrained trajectory optimization as a novel form of constrained functional minimization over trajectory distributions, which avoids treating the constraints as a penalty in the objective and allows us to generate diverse sets of constraint-satisfying trajectories. Our method uses Stein Variational Gradient Descent (SVGD) to find a set of particles that approximates a distribution over low-cost trajectories while obeying constraints. CSVTO is applicable to problems with differentiable equality and inequality constraints and includes a novel particle re-sampling step to escape local minima. By explicitly generating diverse sets of trajectories, CSVTO is better able to avoid poor local minima and is more robust to initialization. We demonstrate that CSVTO outperforms baselines in challenging highly-constrained tasks, such as a 7DoF wrench manipulation task, where CSVTO outperforms all baselines both in success and constraint satisfaction.
