Learning from Hallucinating Critical Points for Navigation in Dynamic Environments
Saad Abdul Ghani, Kameron Lee, Xuesu Xiao
TL;DR
The paper addresses the challenge of learning to navigate in dynamic environments by generating rich dynamic obstacle datasets without costly demonstrations. It introduces Learning from Hallucinating Critical Points (LfH-CP), a two-stage, self-supervised framework that first identifies critical obstacle configurations and then samples diverse trajectories through these points, guided by a novel dataset-diversity metric. The approach reframes planning under the Learning from Hallucination paradigm, renders obstacle trajectories consistent with optimal plans, and trains a transformer-based motion planner from the synthetic data. Empirical results in simulation show that LfH-CP yields substantially greater dataset coverage than prior hallucination methods and improves navigation success rates compared to baselines, highlighting its potential for scalable, safe learning in dynamic domains.
Abstract
Generating large and diverse obstacle datasets to learn motion planning in environments with dynamic obstacles is challenging due to the vast space of possible obstacle trajectories. Inspired by hallucination-based data synthesis approaches, we propose Learning from Hallucinating Critical Points (LfH-CP), a self-supervised framework for creating rich dynamic obstacle datasets based on existing optimal motion plans without requiring expensive expert demonstrations or trial-and-error exploration. LfH-CP factorizes hallucination into two stages: first identifying when and where obstacles must appear in order to result in an optimal motion plan, i.e., the critical points, and then procedurally generating diverse trajectories that pass through these points while avoiding collisions. This factorization avoids generative failures such as mode collapse and ensures coverage of diverse dynamic behaviors. We further introduce a diversity metric to quantify dataset richness and show that LfH-CP produces substantially more varied training data than existing baselines. Experiments in simulation demonstrate that planners trained on LfH-CP datasets achieves higher success rates compared to a prior hallucination method.
