DiPPeR: Diffusion-based 2D Path Planner applied on Legged Robots
Jianwei Liu, Maria Stamatopoulou, Dimitrios Kanoulas
TL;DR
DiPPeR presents a diffusion-based 2D path planner for quadrupedal robots, conditioning trajectory generation on map images and inpainting-based start/goal positions. It trains a DDPM with a ResNet-18 visual encoder and FiLM conditioning on a large dataset of $10{,}000$ maps with $100$ trajectories each, using a horizon of $T=180$ and diffusion steps $k=1000$, to produce trajectories $A_t$. Inference achieves about $0.4$ s per trajectory and an average feasibility of ${87\%}$, outperforming $A^*$, Neural $A^*$, and ViT-$A^*$ by roughly ${23\times}$ on varying map sizes and obstacle structures; real-world deployment on Spot and Go1 demonstrates platform-agnostic integration with ROS navigation and local planners. The work highlights both the practical potential and current limits, notably dependence on horizon estimation, with future directions including transformer-based diffusion models and expanded map representations.
Abstract
In this work, we present DiPPeR, a novel and fast 2D path planning framework for quadrupedal locomotion, leveraging diffusion-driven techniques. Our contributions include a scalable dataset generator for map images and corresponding trajectories, an image-conditioned diffusion planner for mobile robots, and a training/inference pipeline employing CNNs. We validate our approach in several mazes, as well as in real-world deployment scenarios on Boston Dynamic's Spot and Unitree's Go1 robots. DiPPeR performs on average 23 times faster for trajectory generation against both search based and data driven path planning algorithms with an average of 87% consistency in producing feasible paths of various length in maps of variable size, and obstacle structure. Website: https://rpl-cs-ucl.github.io/DiPPeR
