Generative Trajectory Stitching through Diffusion Composition
Yunhao Luo, Utkarsh A. Mishra, Yilun Du, Danfei Xu
TL;DR
This work introduces CompDiffuser, a diffusion-based planner that generates long-horizon trajectories by composing overlapping short-horizon chunks learned from short demonstrations. It models the trajectory distribution p_θ(τ|q_s,q_g) in a compositional form, with a single diffusion model conditioning on neighboring chunks and endpoints to propagate information bidirectionally and ensure dynamic consistency. Training uses a unified denoiser ε_θ with objectives 𝓛_nbr, 𝓛_start, and 𝓛_end to encode start and goal conditioning, while inference supports parallel and autoregressive sampling with exponential blending to merge chunks into a coherent plan. Extensive experiments across PointMaze and high-dimensional benchmarks demonstrate strong improvements over baselines, showing CompDiffuser can solve unseen long-horizon tasks from short-horizon data and even support test-time replanning in complex environments.
Abstract
Effective trajectory stitching for long-horizon planning is a significant challenge in robotic decision-making. While diffusion models have shown promise in planning, they are limited to solving tasks similar to those seen in their training data. We propose CompDiffuser, a novel generative approach that can solve new tasks by learning to compositionally stitch together shorter trajectory chunks from previously seen tasks. Our key insight is modeling the trajectory distribution by subdividing it into overlapping chunks and learning their conditional relationships through a single bidirectional diffusion model. This allows information to propagate between segments during generation, ensuring physically consistent connections. We conduct experiments on benchmark tasks of various difficulties, covering different environment sizes, agent state dimension, trajectory types, training data quality, and show that CompDiffuser significantly outperforms existing methods.
