Unsupervised deep learning for semantic segmentation of multispectral LiDAR forest point clouds
Lassi Ruoppa, Oona Oinonen, Josef Taher, Matti Lehtomäki, Narges Takhtkeshha, Antero Kukko, Harri Kaartinen, Juha Hyyppä
TL;DR
This work tackles leaf--wood segmentation in high-density multispectral ALS forest point clouds without relying on labeled data. It introduces GrowSP-ForMS, an unsupervised deep learning model adapted from GrowSP that leverages covariance-based geometric descriptors, a graph-based superpoint constructor, adaptive clustering weights, and oversegmentation to improve semantic leaf–wood separation on multispectral data. Across extensive boreal forest data, GrowSP-ForMS achieves a mean IoU of $69.6\%$ and mean accuracy of $84.3\%$, outperforming unsupervised baselines and approaching early supervised methods, with multispectral information providing an additional $\sim5.6$ percentage points in mIoU. The results establish a new state-of-the-art for unsupervised leaf--wood segmentation on MS ALS data and outline concrete ablations and future directions (backbone improvements, unsupervised contrastive pretraining, and extension to other LiDAR modalities).
Abstract
Point clouds captured with laser scanning systems from forest environments can be utilized in a wide variety of applications within forestry and plant ecology, such as the estimation of tree stem attributes, leaf angle distribution, and above-ground biomass. However, effectively utilizing the data in such tasks requires the semantic segmentation of the data into wood and foliage points, also known as leaf-wood separation. The traditional approach to leaf-wood separation has been geometry- and radiometry-based unsupervised algorithms, which tend to perform poorly on data captured with airborne laser scanning (ALS) systems, even with a high point density. While recent machine and deep learning approaches achieve great results even on sparse point clouds, they require manually labeled training data, which is often extremely laborious to produce. Multispectral (MS) information has been demonstrated to have potential for improving the accuracy of leaf-wood separation, but quantitative assessment of its effects has been lacking. This study proposes a fully unsupervised deep learning method, GrowSP-ForMS, which is specifically designed for leaf-wood separation of high-density MS ALS point clouds and based on the GrowSP architecture. GrowSP-ForMS achieved a mean accuracy of 84.3% and a mean intersection over union (mIoU) of 69.6% on our MS test set, outperforming the unsupervised reference methods by a significant margin. When compared to supervised deep learning methods, our model performed similarly to the slightly older PointNet architecture but was outclassed by more recent approaches. Finally, two ablation studies were conducted, which demonstrated that our proposed changes increased the test set mIoU of GrowSP-ForMS by 29.4 percentage points (pp) in comparison to the original GrowSP model and that utilizing MS data improved the mIoU by 5.6 pp from the monospectral case.
