Rapid Exploration for Open-World Navigation with Latent Goal Models
Dhruv Shah, Benjamin Eysenbach, Gregory Kahn, Nicholas Rhinehart, Sergey Levine
TL;DR
The paper tackles robust, goal-directed navigation in open-world environments by learning a latent goal representation through an information bottleneck and pairing it with a non-parametric topological memory. RECON samples feasible latent goals to drive frontier-based exploration and incrementally builds a topological map to reach user-specified visual goals. In real-world tests across eight outdoor environments, RECON discovers goals up to 80 meters away in under 20 minutes and remains robust to distractors and non-stationary changes, outperforming several baselines. The dataset used for offline training and evaluation is released to support future research in real-world visual navigation. These contributions offer a practical, data-efficient framework for rapid exploration and navigation without explicit geometric maps.
Abstract
We describe a robotic learning system for autonomous exploration and navigation in diverse, open-world environments. At the core of our method is a learned latent variable model of distances and actions, along with a non-parametric topological memory of images. We use an information bottleneck to regularize the learned policy, giving us (i) a compact visual representation of goals, (ii) improved generalization capabilities, and (iii) a mechanism for sampling feasible goals for exploration. Trained on a large offline dataset of prior experience, the model acquires a representation of visual goals that is robust to task-irrelevant distractors. We demonstrate our method on a mobile ground robot in open-world exploration scenarios. Given an image of a goal that is up to 80 meters away, our method leverages its representation to explore and discover the goal in under 20 minutes, even amidst previously-unseen obstacles and weather conditions. Please check out the project website for videos of our experiments and information about the real-world dataset used at https://sites.google.com/view/recon-robot.
