Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks
Zeyu Feng, Hao Luan, Kevin Yuchen Ma, Harold Soh
TL;DR
The paper tackles long-horizon trajectory planning with temporally extended goals specified by $LTL$ in offline settings. It introduces Doppler, an offline hierarchical RL framework that models options with diffusion-based policies and employs a product MDP $M_{\Psi}$ to capture non-Markovian LTL rewards. A key contribution is diversity-guided sampling, inspired by determinantal point processes, to generate a rich but dataset-supported set of options within the offline distribution. Empirical results in simulation and real-world robots show Doppler achieves higher $LTL$ satisfaction and robustness to perturbations compared to baselines, illustrating the practicality of closed-loop, temporally-aware planning from offline data.
Abstract
Safe and successful deployment of robots requires not only the ability to generate complex plans but also the capacity to frequently replan and correct execution errors. This paper addresses the challenge of long-horizon trajectory planning under temporally extended objectives in a receding horizon manner. To this end, we propose DOPPLER, a data-driven hierarchical framework that generates and updates plans based on instruction specified by linear temporal logic (LTL). Our method decomposes temporal tasks into chain of options with hierarchical reinforcement learning from offline non-expert datasets. It leverages diffusion models to generate options with low-level actions. We devise a determinantal-guided posterior sampling technique during batch generation, which improves the speed and diversity of diffusion generated options, leading to more efficient querying. Experiments on robot navigation and manipulation tasks demonstrate that DOPPLER can generate sequences of trajectories that progressively satisfy the specified formulae for obstacle avoidance and sequential visitation. Demonstration videos are available online at: https://philiptheother.github.io/doppler/.
