BMP: Bridging the Gap between B-Spline and Movement Primitives
Weiran Liao, Ge Li, Hongyi Zhou, Rudolf Lioutikov, Gerhard Neumann
TL;DR
The paper introduces BMPs, a framework that reinterprets B-splines as probabilistic Movement Primitives to jointly satisfy boundary conditions and model trajectory distributions. By using a clamped uniform B-spline basis and linear-Gaussian weight modeling, BMPs inherit ProMP-like probabilistic capabilities while ensuring exact start/end positions and velocities. The approach is demonstrated to be more expressive than existing MPs and capable of satisfying velocity bounds and boundary constraints, with successful applications in imitation learning and episodic reinforcement learning, including specialized training schemes (log-likelihood-based and time-pair sampling). The findings suggest BMPs broaden the applicability of B-splines in robot learning and offer enhanced expressiveness for IL and RL tasks, potentially improving planning, learning efficiency, and constraint satisfaction in real-world robotic systems.
Abstract
This work introduces B-spline Movement Primitives (BMPs), a new Movement Primitive (MP) variant that leverages B-splines for motion representation. B-splines are a well-known concept in motion planning due to their ability to generate complex, smooth trajectories with only a few control points while satisfying boundary conditions, i.e., passing through a specified desired position with desired velocity. However, current usages of B-splines tend to ignore the higher-order statistics in trajectory distributions, which limits their usage in imitation learning (IL) and reinforcement learning (RL), where modeling trajectory distribution is essential. In contrast, MPs are commonly used in IL and RL for their capacity to capture trajectory likelihoods and correlations. However, MPs are constrained by their abilities to satisfy boundary conditions and usually need extra terms in learning objectives to satisfy velocity constraints. By reformulating B-splines as MPs, represented through basis functions and weight parameters, BMPs combine the strengths of both approaches, allowing B-splines to capture higher-order statistics while retaining their ability to satisfy boundary conditions. Empirical results in IL and RL demonstrate that BMPs broaden the applicability of B-splines in robot learning and offer greater expressiveness compared to existing MP variants.
