Center-based 3D Object Detection and Tracking
Tianwei Yin, Xingyi Zhou, Philipp Krähenbühl
TL;DR
CenterPoint introduces a center-based approach for simultaneous 3D object detection and tracking from LiDAR, treating objects as centers rather than axis-aligned boxes. A two-stage architecture first detects centers and regresses full 3D properties, then refines with point features from object faces; tracking uses velocity estimates and greedy closest-point association, avoiding heavy motion models. The method achieves state-of-the-art results on Waymo and nuScenes with single-model backbones and near real-time speed, highlighting strong gains from center-based representation and lightweight refinement. This approach simplifies 3D perception pipelines while delivering high accuracy and robust tracking suitable for autonomous driving.
Abstract
Three-dimensional objects are commonly represented as 3D boxes in a point-cloud. This representation mimics the well-studied image-based 2D bounding-box detection but comes with additional challenges. Objects in a 3D world do not follow any particular orientation, and box-based detectors have difficulties enumerating all orientations or fitting an axis-aligned bounding box to rotated objects. In this paper, we instead propose to represent, detect, and track 3D objects as points. Our framework, CenterPoint, first detects centers of objects using a keypoint detector and regresses to other attributes, including 3D size, 3D orientation, and velocity. In a second stage, it refines these estimates using additional point features on the object. In CenterPoint, 3D object tracking simplifies to greedy closest-point matching. The resulting detection and tracking algorithm is simple, efficient, and effective. CenterPoint achieved state-of-the-art performance on the nuScenes benchmark for both 3D detection and tracking, with 65.5 NDS and 63.8 AMOTA for a single model. On the Waymo Open Dataset, CenterPoint outperforms all previous single model method by a large margin and ranks first among all Lidar-only submissions. The code and pretrained models are available at https://github.com/tianweiy/CenterPoint.
