Label-Efficient Point Cloud Segmentation with Active Learning
Johannes Meyer, Jasper Hoffmann, Felix Schulz, Dominik Merkle, Daniel Buescher, Alexander Reiterer, Joschka Boedecker, Wolfram Burgard
TL;DR
This work tackles the high labeling cost in 3D point-cloud semantic segmentation by introducing a lightweight active learning pipeline that partitions scenes into easily annotatable 2D grid columns and selects regions using ensemble uncertainty (Entropy and VaR). It formally measures annotation effort with an area-based metric, and demonstrates competitive or superior performance to state-of-the-art region-based AL methods across S3DIS, Toronto-3D, and Freiburg datasets. The approach reduces preprocessing complexity, maintains strong performance, and reveals that annotated area can be a more meaningful efficiency proxy than the number of annotated points. These findings suggest practical gains for deploying AL in large-scale urban point clouds and highlight avenues for integrating richer augmentations and scalable region resolutions.
Abstract
Semantic segmentation of 3D point cloud data often comes with high annotation costs. Active learning automates the process of selecting which data to annotate, reducing the total amount of annotation needed to achieve satisfactory performance. Recent approaches to active learning for 3D point clouds are often based on sophisticated heuristics for both, splitting point clouds into annotatable regions and selecting the most beneficial for further neural network training. In this work, we propose a novel and easy-to-implement strategy to separate the point cloud into annotatable regions. In our approach, we utilize a 2D grid to subdivide the point cloud into columns. To identify the next data to be annotated, we employ a network ensemble to estimate the uncertainty in the network output. We evaluate our method on the S3DIS dataset, the Toronto-3D dataset, and a large-scale urban 3D point cloud of the city of Freiburg, which we labeled in parts manually. The extensive evaluation shows that our method yields performance on par with, or even better than, complex state-of-the-art methods on all datasets. Furthermore, we provide results suggesting that in the context of point clouds the annotated area can be a more meaningful measure for active learning algorithms than the number of annotated points.
