Is clustering enough for LiDAR instance segmentation? A state-of-the-art training-free baseline
Corentin Sautier, Gilles Puy, Alexandre Boulch, Renaud Marlet, Vincent Lepetit
TL;DR
This work tackles LiDAR panoptic segmentation by showing that a training-free baseline can rival supervised methods. It introduces ALPINE, a per-class BEV clustering pipeline that uses a fast kNN graph and connected components to extract instances from semantic predictions, with class-wise thresholds and a box-splitting refinement. Empirical results across SemanticKITTI, nuScenes, and SemanticPOSS demonstrate strong PQ performance, often matching or exceeding state-of-the-art methods without any instance labels or training, and with real-time CPU runtime. The study highlights that current instance heads may be saturated and positions ALPINE as a robust, explainable baseline that can be paired with any semantic backbone. It also provides extensive ablations, upper-bound analyses with oracles, and practical parameter settings, suggesting broad applicability and a clear benchmark for future end-to-end panoptic methods.
Abstract
Panoptic segmentation of LiDAR point clouds is fundamental to outdoor scene understanding, with autonomous driving being a primary application. While state-of-the-art approaches typically rely on end-to-end deep learning architectures and extensive manual annotations of instances, the significant cost and time investment required for labeling large-scale point cloud datasets remains a major bottleneck in this field. In this work, we demonstrate that competitive panoptic segmentation can be achieved using only semantic labels, with instances predicted without any training or annotations. Our method outperforms {most} state-of-the-art supervised methods on standard benchmarks including SemanticKITTI and nuScenes, and outperforms every publicly available method on SemanticKITTI as a drop-in instance head replacement, while running in real-time on a single-threaded CPU and requiring no instance labels. It is fully explainable, and requires no learning or parameter tuning. Alpine combined with state-of-the-art semantic segmentation ranks first on the official panoptic segmentation leaderboard of SemanticKITTI. Code is available at https://github.com/valeoai/Alpine/
