Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving

Yurong You; Yan Wang; Wei-Lun Chao; Divyansh Garg; Geoff Pleiss; Bharath Hariharan; Mark Campbell; Kilian Q. Weinberger

Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving

Yurong You, Yan Wang, Wei-Lun Chao, Divyansh Garg, Geoff Pleiss, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

TL;DR

The paper tackles the cost-depth trade-off in autonomous-driving 3D object detection by addressing depth estimation biases in stereo-based pseudo-LiDAR. It introduces a Stereo Depth Network (SDN) that learns direct depth using a depth-cost volume and a depth loss, plus a Graph-based Depth Correction (GDC) that fuses sparse LiDAR measurements with dense stereo depth through landmark-guided diffusion. On KITTI, SDN improves depth accuracy and BEV/3D detection, and GDC further enhances performance, with pseudo-LiDAR++ (PL++: SDN + GDC) approaching 64-beam LiDAR performance while using only 4 beams and stereo cameras. The approach promises substantial cost reductions and real-time feasibility, demonstrating that combining dense stereo depth with sparse high-precision measurements can rival expensive LiDAR-only systems.

Abstract

Detecting objects such as cars and pedestrians in 3D plays an indispensable role in autonomous driving. Existing approaches largely rely on expensive LiDAR sensors for accurate depth information. While recently pseudo-LiDAR has been introduced as a promising alternative, at a much lower cost based solely on stereo images, there is still a notable performance gap. In this paper we provide substantial advances to the pseudo-LiDAR framework through improvements in stereo depth estimation. Concretely, we adapt the stereo network architecture and loss function to be more aligned with accurate depth estimation of faraway objects --- currently the primary weakness of pseudo-LiDAR. Further, we explore the idea to leverage cheaper but extremely sparse LiDAR sensors, which alone provide insufficient information for 3D detection, to de-bias our depth estimation. We propose a depth-propagation algorithm, guided by the initial depth estimates, to diffuse these few exact measurements across the entire depth map. We show on the KITTI object detection benchmark that our combined approach yields substantial improvements in depth estimation and stereo-based 3D object detection --- outperforming the previous state-of-the-art detection accuracy for faraway objects by 40%. Our code is available at https://github.com/mileyan/Pseudo_Lidar_V2.

Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving

TL;DR

Abstract

Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)