NeRF-LiDAR: Generating Realistic LiDAR Point Clouds with Neural Radiance Fields
Junge Zhang, Feihu Zhang, Shaochen Kuang, Li Zhang
TL;DR
NeRF-LiDAR addresses the high cost of labeling LiDAR data for autonomous driving by learning a NeRF-based implicit representation of real-world driving scenes from multi-view images and sparse LiDAR, and then rendering realistic LiDAR point clouds with per-point semantic labels. It integrates a NeRF reconstruction module with a generation pipeline that includes raydrop, equirectangular projection, and feature- and point-level alignments, guided by weak 2D labels and limited 3D annotations. Experiments on nuScenes show that models trained on NeRF-LiDAR data achieve competitive or superior performance to those trained on real data, with substantial gains from pre-training on simulated data and effective fine-tuning with limited real data. The approach enables realistic multi-sensor simulation, supports new sensor configurations, and yields efficient rendering, offering a practical path to data-efficient 3D perception for autonomous systems.
Abstract
Labeling LiDAR point clouds for training autonomous driving is extremely expensive and difficult. LiDAR simulation aims at generating realistic LiDAR data with labels for training and verifying self-driving algorithms more efficiently. Recently, Neural Radiance Fields (NeRF) have been proposed for novel view synthesis using implicit reconstruction of 3D scenes. Inspired by this, we present NeRF-LIDAR, a novel LiDAR simulation method that leverages real-world information to generate realistic LIDAR point clouds. Different from existing LiDAR simulators, we use real images and point cloud data collected by self-driving cars to learn the 3D scene representation, point cloud generation and label rendering. We verify the effectiveness of our NeRF-LiDAR by training different 3D segmentation models on the generated LiDAR point clouds. It reveals that the trained models are able to achieve similar accuracy when compared with the same model trained on the real LiDAR data. Besides, the generated data is capable of boosting the accuracy through pre-training which helps reduce the requirements of the real labeled data.
