Learning to Explore using Active Neural SLAM
Devendra Singh Chaplot, Dhiraj Gandhi, Saurabh Gupta, Abhinav Gupta, Ruslan Salakhutdinov
TL;DR
This paper presents Active Neural SLAM (ANS), a modular, hierarchical navigation framework that blends a learned Neural SLAM with a Global policy and a Local policy connected via an analytical planner. By training the components separately within a classical navigation pipeline, ANS achieves robust exploration in realistic 3D environments and demonstrates strong transfer to real-world robotics and the PointGoal task, including significant sample-efficiency gains. Key contributions include a realistic actuation/sensor noise model, a Mapper + Pose Estimator Neural SLAM module, and empirical evidence of superior performance and generalization over end-to-end baselines. The approach advances practical autonomous exploration by leveraging learning where it benefits most while retaining the reliability and efficiency of traditional planning, with demonstrated impact on Habitat benchmarks and real-world transfer.
Abstract
This work presents a modular and hierarchical approach to learn policies for exploring 3D environments, called `Active Neural SLAM'. Our approach leverages the strengths of both classical and learning-based methods, by using analytical path planners with learned SLAM module, and global and local policies. The use of learning provides flexibility with respect to input modalities (in the SLAM module), leverages structural regularities of the world (in global policies), and provides robustness to errors in state estimation (in local policies). Such use of learning within each module retains its benefits, while at the same time, hierarchical decomposition and modular training allow us to sidestep the high sample complexities associated with training end-to-end policies. Our experiments in visually and physically realistic simulated 3D environments demonstrate the effectiveness of our approach over past learning and geometry-based approaches. The proposed model can also be easily transferred to the PointGoal task and was the winning entry of the CVPR 2019 Habitat PointGoal Navigation Challenge.
