Planning Using Schrödinger Bridge Diffusion Models
Adarsh Srivastava
TL;DR
This paper investigates speeding up offline planning by incorporating cheap priors through a Schrödinger-bridge diffusion framework. It integrates an image-to-image Schrödinger Bridge (I2SB) with the Diffuser planning approach, introducing three prior types (analytical, learned, random) and evaluating on Maze2D tasks to assess sample and sampling efficiency. The findings show that I2SB can outperform DDPM at very low NFEs due to closed-form sampling and informative priors, but DDPM generally catches up or surpasses I2SB at higher NFEs, with learned priors offering the strongest gains among the priors examined. The work highlights the potential and limitations of prior-guided diffusion for planning, suggesting future exploration of more efficient bridging methods and extensions to higher-dimensional trajectory tasks.
Abstract
Offline planning often struggles with poor sampling efficiency as it tries to learn policies from scratch. Especially with diffusion models, such cold start practices mean that both training and sampling become very expensive. We hypothesize that certain environment constraint priors or cheaply available policies make it unnecessary to learn from scratch, and explore a way to incorporate such priors in the learning process. To achieve that, we borrow a variation of the Schrödinger bridge formulation from the image-to-image setting and apply it to planning tasks. We study the performance on some planning tasks and compare the performance against the DDPM formulation. The code for this work is available at https://github.com/adrshsrvstv/bridge_diffusion_planning.
