Differentiable Particle Optimization for Fast Sequential Manipulation
Lucas Chen, Shrutheesh Raman Iyer, Zachary Kingston
TL;DR
SPaSM introduces a fully GPU-parallelized framework for sequential robotic manipulation that eliminates CPU-GPU coordination bottlenecks by compiling constraint evaluation, sampling, and gradient-based optimization into end-to-end CUDA kernels. The method solves a two-stage problem: first a parallel particle optimization for object placements (Object Variable CSP), then a joint-space trajectory optimization that lifts placements into robot motions, enabling simultaneous grounding and motion feasibility. Experiments on challenging benchmarks (point-to-point, Tetris packing, and tower stacking) show millisecond-scale solution times with 100% success and up to ~4000× speedups over state-of-the-art baselines, demonstrating real-time replanning capabilities in cluttered environments. The work highlights the importance of integrated grounding and motion optimization and points toward extensions to perception-enabled, receding-horizon, and multi-robot sequential planning.
Abstract
Sequential robot manipulation tasks require finding collision-free trajectories that satisfy geometric constraints across multiple object interactions in potentially high-dimensional configuration spaces. Solving these problems in real-time and at large scales has remained out of reach due to computational requirements. Recently, GPU-based acceleration has shown promising results, but prior methods achieve limited performance due to CPU-GPU data transfer overhead and complex logic that prevents full hardware utilization. To this end, we present SPaSM (Sampling Particle optimization for Sequential Manipulation), a fully GPU-parallelized framework that compiles constraint evaluation, sampling, and gradient-based optimization into optimized CUDA kernels for end-to-end trajectory optimization without CPU coordination. The method consists of a two-stage particle optimization strategy: first solving placement constraints through massively parallel sampling, then lifting solutions to full trajectory optimization in joint space. Unlike hierarchical approaches, SPaSM jointly optimizes object placements and robot trajectories to handle scenarios where motion feasibility constrains placement options. Experimental evaluation on challenging benchmarks demonstrates solution times in the realm of $\textbf{milliseconds}$ with a 100% success rate; a $4000\times$ speedup compared to existing approaches. Code and examples are available at $\href{https://commalab.org/papers/spasm}{commalab.org/papers/spasm}$.
