Uniform Sampling from the Reachable Set Using Optimal Transport
Karthik Elamvazhuthi, Sachin Shivakumar
TL;DR
This paper reframes reachability sampling under bounded controls as an Optimal Transport problem, aiming to push the joint distribution on initial conditions and controls so that the terminal-state distribution matches a uniform target over the reachable set. It introduces a tractable, particle-based approach by relaxing the hard transport constraint with an $L_2$-based penalty and mollifying it to enable kernel-coupled terminal costs, then demonstrates convergence via Gamma-convergence across successive relaxations. Theoretical results show existence of minimizers and convergence of the relaxed and finite-dimensional problems to the original formulation. Numerical experiments on systems with strong attractors, such as the Vander Pol oscillator and a three-arm planar robot, show significantly improved uniformity of reachable-set sampling compared to random-control strategies, underscoring the method’s potential for robust safety verification and planning.
Abstract
Estimating the reachable set of a dynamical system is a fundamental problem in control theory, particularly when control inputs are bounded. Direct simulation using randomly sampled admissible controls often leads to trajectories that cluster near attractors, resulting in poor coverage of the reachable set. To achieve a more uniform distribution of terminal states, we formulate the problem within an Optimal Transport (OT) framework. In this setting, the goal is to steer the system so that the final state distribution, determined by the chosen controls and initial conditions, matches a desired target distribution. Enforcing this condition exactly is not possible since the reachable set is not known. So we introduce an $L_2$-norm based regularization of the terminal distribution that relaxes the constraint while promoting uniform coverage. The resulting formulation can be approximated by a finite-dimensional, particle-based optimal control problem with kernel-coupled terminal cost. We show that this approach converges to the original formulation and demonstrate through numerical examples that it provides significantly more uniform reachable-set sampling than random control strategies.
