Instance Segmentation for Point Sets
Abhimanyu Talwar, Julien Laasri
TL;DR
This work tackles the memory bottleneck of SGPN, which requires an $N \times N$ similarity matrix for instance segmentation on 3D point sets. It introduces two sub-sampling strategies—Random Sub-sampling and Grid-based Sub-sampling—to operate on a fixed landmark subset of size $K$, then propagate labels to the full point set via nearest-neighbor rules, effectively reducing memory from $O(N^2)$ to $O(K^2)$ and speeding up computation. Experiments on the Stanford Indoor 3D dataset show that sub-sampling can achieve near full-set performance with substantially lower memory and time costs, with random sub-sampling performing best at smaller $K$ while grid-based sampling can better capture certain patterns at comparable $K$. These techniques enable scalable 3D point-set instance segmentation, offering practical routes to deploy PointNet++-based methods on larger scenes. The approaches also open avenues for future work, such as propagation in feature space and alternative landmark selection strategies.
Abstract
Recently proposed neural network architectures like PointNet [QSMG16] and PointNet++ [QYSG17] have made it possible to apply Deep Learning to 3D point sets. The feature representations of shapes learned by these two networks enabled training classifiers for Semantic Segmentation, and more recently for Instance Segmentation via the Similarity Group Proposal Network (SGPN) [WYHN17]. One area of improvement which has been highlighted by SGPN's authors, pertains to use of memory intensive similarity matrices which occupy memory quadratic in the number of points. In this report, we attempt to tackle this issue through use of two sampling based methods, which compute Instance Segmentation on a sub-sampled Point Set, and then extrapolate labels to the complete set using the nearest neigbhour approach. While both approaches perform equally well on large sub-samples, the random-based strategy gives the most improvements in terms of speed and memory usage.
