NeRO: Neural Road Surface Reconstruction
Ruibo Wang, Song Zhang, Ping Huang, Donghai Zhang, Haoyu Chen
TL;DR
This work tackles road surface reconstruction for applications like autonomous driving by introducing NeRO, a neural framework that maps world coordinates $ (x,y) $ to road height $ z $, color $ c $, and semantic $ s $ using position-encoding MLPs. It demonstrates compatibility with multiple height sources (vehicle pose, LiDAR, SfM) and robustness to semantic noise, while achieving fast training suitable for road visualization and 4D labeling. The paper systematically compares standard positional encoding and Multi-Resolution Hash Positional Encoding, showing how hash-based encoding improves detail and speed, especially under sparse or noisy semantic supervision. By integrating height, color, and semantics in a unified, mesh-free representation, NeRO enables efficient rendering of semantically rich road surfaces and supports future 4D labeling and semantic grouping tasks in real-world scenes.
Abstract
Accurately reconstructing road surfaces is pivotal for various applications especially in autonomous driving. This paper introduces a position encoding Multi-Layer Perceptrons (MLPs) framework to reconstruct road surfaces, with input as world coordinates x and y, and output as height, color, and semantic information. The effectiveness of this method is demonstrated through its compatibility with a variety of road height sources like vehicle camera poses, LiDAR point clouds, and SFM point clouds, robust to the semantic noise of images like sparse labels and noise semantic prediction, and fast training speed, which indicates a promising application for rendering road surfaces with semantics, particularly in applications demanding visualization of road surface, 4D labeling, and semantic groupings.
