WayEx: Waypoint Exploration using a Single Demonstration
Mara Levy, Nirat Saini, Abhinav Shrivastava
TL;DR
The paper tackles the challenge of learning goal-conditioned robotics tasks with minimal supervision by proposing WayEx, a framework that learns from a single demonstration without requiring action-space information. It introduces a proximal waypoint reward mechanism and a knowledge expansion strategy to generalize from the demonstrated trajectory to unseen start and goal states, operating as a wrapper around standard RL algorithms with a sparse reward $R(s,a)=0$ if $s=g$ and $-1$ otherwise. Empirically, WayEx accelerates learning by guiding exploration toward waypoints and demonstrates strong performance across six tasks, often surpassing baselines that rely on many demonstrations, while matching or exceeding results even when baselines receive $100$ demonstrations. The approach offers practical impact by reducing data and computation requirements in robotic learning and improving robustness to sparse rewards, with future work exploring nonlinear state representations and image-based inputs.
Abstract
We propose WayEx, a new method for learning complex goal-conditioned robotics tasks from a single demonstration. Our approach distinguishes itself from existing imitation learning methods by demanding fewer expert examples and eliminating the need for information about the actions taken during the demonstration. This is accomplished by introducing a new reward function and employing a knowledge expansion technique. We demonstrate the effectiveness of WayEx, our waypoint exploration strategy, across six diverse tasks, showcasing its applicability in various environments. Notably, our method significantly reduces training time by 50% as compared to traditional reinforcement learning methods. WayEx obtains a higher reward than existing imitation learning methods given only a single demonstration. Furthermore, we demonstrate its success in tackling complex environments where standard approaches fall short. More information is available at: https://waypoint-ex.github.io.
