RoadRunner M&M -- Learning Multi-range Multi-resolution Traversability Maps for Autonomous Off-road Navigation
Manthan Patel, Jonas Frey, Deegan Atha, Patrick Spieler, Marco Hutter, Shehryar Khattak
TL;DR
RoadRunner M&M advances off-road autonomous navigation by predicting elevation and traversability maps at multiple ranges ($oldsymbol{ imes}$ $ ext{±50m}$ at $0.2m$ and $ ext{±100m}$ at $0.8m$) using a multi-modal, end-to-end network that fuses image BEV and LiDAR voxel features. It learns from self-supervised pseudo ground truth generated via hindsight fusion with X-Racer and satellite DEMs, achieving up to ~50% elevation MAE improvement and ~30% traversability gains over RoadRunner, while increasing map coverage and reducing latency to ~100 ms. The approach includes a hierarchical multi-resolution decoder and a loss scheme that handles observed/unobserved regions and cross-range consistency, and it demonstrates real-time deployment and generalization to diverse out-of-distribution environments. Integration with a planning stack enables high-speed, autonomous off-road navigation in real-world field deployments, highlighting both practical impact and areas for further improvement, such as precise risk localization and uncertainty quantification.
Abstract
Autonomous robot navigation in off-road environments requires a comprehensive understanding of the terrain geometry and traversability. The degraded perceptual conditions and sparse geometric information at longer ranges make the problem challenging especially when driving at high speeds. Furthermore, the sensing-to-mapping latency and the look-ahead map range can limit the maximum speed of the vehicle. Building on top of the recent work RoadRunner, in this work, we address the challenge of long-range (100 m) traversability estimation. Our RoadRunner (M&M) is an end-to-end learning-based framework that directly predicts the traversability and elevation maps at multiple ranges (50 m, 100 m) and resolutions (0.2 m, 0.8 m) taking as input multiple images and a LiDAR voxel map. Our method is trained in a self-supervised manner by leveraging the dense supervision signal generated by fusing predictions from an existing traversability estimation stack (X-Racer) in hindsight and satellite Digital Elevation Maps. RoadRunner M&M achieves a significant improvement of up to 50% for elevation mapping and 30% for traversability estimation over RoadRunner, and is able to predict in 30% more regions compared to X-Racer while achieving real-time performance. Experiments on various out-of-distribution datasets also demonstrate that our data-driven approach starts to generalize to novel unstructured environments. We integrate our proposed framework in closed-loop with the path planner to demonstrate autonomous high-speed off-road robotic navigation in challenging real-world environments. Project Page: https://leggedrobotics.github.io/roadrunner_mm/
