Viewpoint Recommendation for Point Cloud Labeling through Interaction Cost Modeling
Yu Zhang, Xinyi Zhao, Chongke Bi, Siming Chen
TL;DR
This work addresses the time burden of annotating 3D point clouds by introducing a viewpoint recommendation framework that minimizes lasso selection time. It anchors the approach in a Fitts' law–derived time-cost model for lassoing in 2D projections and optimizes viewpoints via grid search to minimize the estimated cost. An integrated labeling system demonstrates reduced labeling time and improved user satisfaction in ablation studies, with qualitative comparisons showing advantages over traditional viewpoint strategies. The results suggest that model-based evaluation can guide the design of interactive data-labeling tools and that viewpoint optimization can meaningfully accelerate routine labeling tasks in 3D vision applications.
Abstract
Semantic segmentation of 3D point clouds is important for many applications, such as autonomous driving. To train semantic segmentation models, labeled point cloud segmentation datasets are essential. Meanwhile, point cloud labeling is time-consuming for annotators, which typically involves tuning the camera viewpoint and selecting points by lasso. To reduce the time cost of point cloud labeling, we propose a viewpoint recommendation approach to reduce annotators' labeling time costs. We adapt Fitts' law to model the time cost of lasso selection in point clouds. Using the modeled time cost, the viewpoint that minimizes the lasso selection time cost is recommended to the annotator. We build a data labeling system for semantic segmentation of 3D point clouds that integrates our viewpoint recommendation approach. The system enables users to navigate to recommended viewpoints for efficient annotation. Through an ablation study, we observed that our approach effectively reduced the data labeling time cost. We also qualitatively compare our approach with previous viewpoint selection approaches on different datasets.
