Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation
Guido Maria D'Amely di Melendugno, Alessandro Flaborea, Pascal Mettes, Fabio Galasso
TL;DR
Hyp$^2$Nav tackles crowd navigation by embedding MDP states in a hyperbolic space, enabling compact, hierarchical representations with only $2$-dimensional latent vectors. It introduces a hyperbolic policy network (HyperPlanner) and a hyperbolic curiosity module (HyperCuriosity) that operate entirely in hyperbolic space, yielding high success rates and rewards with markedly fewer parameters than Euclidean counterparts. The approach demonstrates strong performance in simple and complex CrowdNav scenarios, with interpretable signals such as the hyperbolic radius that correlate with uncertainty and interaction difficulty. Overall, HyperPlanner and HyperCuriosity together enable efficient, interpretable, and robust crowd navigation suitable for real-time embedded deployment and potentially broader hierarchical decision-making tasks.
Abstract
Autonomous robots are increasingly becoming a strong fixture in social environments. Effective crowd navigation requires not only safe yet fast planning, but should also enable interpretability and computational efficiency for working in real-time on embedded devices. In this work, we advocate for hyperbolic learning to enable crowd navigation and we introduce Hyp2Nav. Different from conventional reinforcement learning-based crowd navigation methods, Hyp2Nav leverages the intrinsic properties of hyperbolic geometry to better encode the hierarchical nature of decision-making processes in navigation tasks. We propose a hyperbolic policy model and a hyperbolic curiosity module that results in effective social navigation, best success rates, and returns across multiple simulation settings, using up to 6 times fewer parameters than competitor state-of-the-art models. With our approach, it becomes even possible to obtain policies that work in 2-dimensional embedding spaces, opening up new possibilities for low-resource crowd navigation and model interpretability. Insightfully, the internal hyperbolic representation of Hyp2Nav correlates with how much attention the robot pays to the surrounding crowds, e.g. due to multiple people occluding its pathway or to a few of them showing colliding plans, rather than to its own planned route. The code is available at https://github.com/GDam90/hyp2nav.
