A Simple Framework for 3D Occupancy Estimation in Autonomous Driving
Wanshui Gan, Ningkai Mo, Hongbin Xu, Naoto Yokoya
TL;DR
This work introduces SimpleOccupancy, a CNN-based framework to estimate 3D occupancy from surrounding-view images for autonomous driving. It uses a parameter-free 2D-to-3D unprojection followed by 3D CNNs to produce voxel-wise occupancy probabilities, with an optional SDF-based reconstruction pathway; training encompasses both supervised depth-based losses and self-supervised photometric cues. A novel discrete depth metric for occupancy evaluation is proposed, and the approach is benchmarked on DDAD and Nuscenes, showing competitive depth estimation with effective 3D occupancy reconstruction. The authors also explore a point-level pretraining strategy and discuss connections to monocular depth estimation and semantic occupancy, culminating in a practical framework for 3D perception that can leverage unlabeled data for training.
Abstract
The task of estimating 3D occupancy from surrounding-view images is an exciting development in the field of autonomous driving, following the success of Bird's Eye View (BEV) perception. This task provides crucial 3D attributes of the driving environment, enhancing the overall understanding and perception of the surrounding space. In this work, we present a simple framework for 3D occupancy estimation, which is a CNN-based framework designed to reveal several key factors for 3D occupancy estimation, such as network design, optimization, and evaluation. In addition, we explore the relationship between 3D occupancy estimation and other related tasks, such as monocular depth estimation and 3D reconstruction, which could advance the study of 3D perception in autonomous driving. For evaluation, we propose a simple sampling strategy to define the metric for occupancy evaluation, which is flexible for current public datasets. Moreover, we establish the benchmark in terms of the depth estimation metric, where we compare our proposed method with monocular depth estimation methods on the DDAD and Nuscenes datasets and achieve competitive performance. The relevant code will be updated in https://github.com/GANWANSHUI/SimpleOccupancy.
