Single-Shot Learning of Stable Dynamical Systems for Long-Horizon Manipulation Tasks
Alexandre St-Aubin, Amin Abyaneh, Hsiu-Chin Lin
TL;DR
The paper addresses the challenge of safely learning long-horizon robotic manipulation with guarantees by decomposing demonstrations into subgoals with waypoints and training globally stable dynamical policies for each segment. AWE is used to extract informative waypoints within each segment, and a high-level cascade controller stitches segment policies to reproduce the full trajectory. Each per-segment SNDS is Lyapunov-stable, enabling robust performance under sensory noise and disturbances, and the approach transfers from simulation to real robots with a single demonstration. Experiments in deterministic and perturbed settings, plus zero-shot sim-to-real tests, show substantial improvements over baselines and demonstrate one-shot, data-efficient learning for long-horizon tasks.
Abstract
Mastering complex sequential tasks continues to pose a significant challenge in robotics. While there has been progress in learning long-horizon manipulation tasks, most existing approaches lack rigorous mathematical guarantees for ensuring reliable and successful execution. In this paper, we extend previous work on learning long-horizon tasks and stable policies, focusing on improving task success rates while reducing the amount of training data needed. Our approach introduces a novel method that (1) segments long-horizon demonstrations into discrete steps defined by waypoints and subgoals, and (2) learns globally stable dynamical system policies to guide the robot to each subgoal, even in the face of sensory noise and random disturbances. We validate our approach through both simulation and real-world experiments, demonstrating effective transfer from simulation to physical robotic platforms. Code is available at https://github.com/Alestaubin/stable-imitation-policy-with-waypoints
