Rethinking the Encoding and Annotating of 3D Bounding Box: Corner-Aware 3D Object Detection from Point Clouds

Qinghao Meng; Junbo Yin; Jianbing Shen; Yunde Jia

Rethinking the Encoding and Annotating of 3D Bounding Box: Corner-Aware 3D Object Detection from Point Clouds

Qinghao Meng, Junbo Yin, Jianbing Shen, Yunde Jia

TL;DR

This work tackles the instability of center-aligned regression in LiDAR-based 3D detection by introducing corner-aligned regression, leveraging dense BEV corner observations to improve geometric consistency. It systematically analyzes five corner-encoding schemes, identifying full-corner encoding as the most robust, and presents a two-stage corner-aware detector that can operate under full or partial supervision using BEV corner annotations and height priors from 2D detections. A practical corner-click annotation protocol and a weak-to-full learning strategy enable recovery of complete 3D boxes from partial signals, including height, with geometric constraints guiding recovery. On KITTI, the method achieves a 3D AP improvement of about 3.4 points over a center-based baseline and reaches approximately 83% of fully supervised accuracy using only BEV corner annotations, underscoring the practicality and scalability of corner-aware regression for 3D detection.

Abstract

Center-aligned regression remains dominant in LiDAR-based 3D object detection, yet it suffers from fundamental instability: object centers often fall in sparse or empty regions of the bird's-eye-view (BEV) due to the front-surface-biased nature of LiDAR point clouds, leading to noisy and inaccurate bounding box predictions. To circumvent this limitation, we revisit bounding box representation and propose corner-aligned regression, which shifts the prediction target from unstable centers to geometrically informative corners that reside in dense, observable regions. Leveraging the inherent geometric constraints among corners and image 2D boxes, partial parameters of 3D bounding boxes can be recovered from corner annotations, enabling a weakly supervised paradigm without requiring complete 3D labels. We design a simple yet effective corner-aware detection head that can be plugged into existing detectors. Experiments on KITTI show our method improves performance by 3.5% AP over center-based baseline, and achieves 83% of fully supervised accuracy using only BEV corner clicks, demonstrating the effectiveness of our corner-aware regression strategy.

Rethinking the Encoding and Annotating of 3D Bounding Box: Corner-Aware 3D Object Detection from Point Clouds

TL;DR

Abstract

Rethinking the Encoding and Annotating of 3D Bounding Box: Corner-Aware 3D Object Detection from Point Clouds

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)