WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction

Heng Zhai; Jilin Mei; Chen Min; Liang Chen; Fangzhou Zhao; Yu Hu

WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction

Heng Zhai, Jilin Mei, Chen Min, Liang Chen, Fangzhou Zhao, Yu Hu

TL;DR

WildOcc is introduced, to the authors' knowledge, the first benchmark to provide dense occupancy annotations for off-road 3D semantic occupancy prediction tasks, and a multi-modal 3D semantic occupancy prediction framework, which fuses spatio-temporal information from multi-frame images and point clouds at voxel level.

Abstract

3D semantic occupancy prediction is an essential part of autonomous driving, focusing on capturing the geometric details of scenes. Off-road environments are rich in geometric information, therefore it is suitable for 3D semantic occupancy prediction tasks to reconstruct such scenes. However, most of researches concentrate on on-road environments, and few methods are designed for off-road 3D semantic occupancy prediction due to the lack of relevant datasets and benchmarks. In response to this gap, we introduce WildOcc, to our knowledge, the first benchmark to provide dense occupancy annotations for off-road 3D semantic occupancy prediction tasks. A ground truth generation pipeline is proposed in this paper, which employs a coarse-to-fine reconstruction to achieve a more realistic result. Moreover, we introduce a multi-modal 3D semantic occupancy prediction framework, which fuses spatio-temporal information from multi-frame images and point clouds at voxel level. In addition, a cross-modality distillation function is introduced, which transfers geometric knowledge from point clouds to image features.

WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction

TL;DR

Abstract

WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)