Online Hierarchical Policy Learning using Physics Priors for Robot Navigation in Unknown Environments
Wei Han Chen, Yuchen Liu, Alexiy Buynitsky, Ahmed H. Qureshi
TL;DR
This work addresses navigation in large, unknown indoor environments by integrating physics-informed neural time fields within a hierarchical framework. It introduces Modular-NTFields (mNTFields), which combines an online sparse high-level navigation graph with localized neural Eikonal PDE solvers (low-level subnetworks) to efficiently compute cost-to-go maps while mitigating spectral bias and forgetting. The approach employs online room segmentation, modular subnetworks, adaptive sampling, and a TD-based training objective to enable fast, collision-free planning that scales to complex spaces, demonstrated through simulated and real-world robot experiments. The results show faster mapping, higher planning success, and robust real-world deployment, highlighting the method's potential for online exploration, mapping, and navigation in unknown environments.
Abstract
Robot navigation in large, complex, and unknown indoor environments is a challenging problem. The existing approaches, such as traditional sampling-based methods, struggle with resolution control and scalability, while imitation learning-based methods require a large amount of demonstration data. Active Neural Time Fields (ANTFields) have recently emerged as a promising solution by using local observations to learn cost-to-go functions without relying on demonstrations. Despite their potential, these methods are hampered by challenges such as spectral bias and catastrophic forgetting, which diminish their effectiveness in complex scenarios. To address these issues, our approach decomposes the planning problem into a hierarchical structure. At the high level, a sparse graph captures the environment's global connectivity, while at the low level, a planner based on neural fields navigates local obstacles by solving the Eikonal PDE. This physics-informed strategy overcomes common pitfalls like spectral bias and neural field fitting difficulties, resulting in a smooth and precise representation of the cost landscape. We validate our framework in large-scale environments, demonstrating its enhanced adaptability and precision compared to previous methods, and highlighting its potential for online exploration, mapping, and real-world navigation.
