Semantic segmentation of sparse irregular point clouds for leaf/wood discrimination
Yuchen Bai, Jean-Baptiste Durand, Grégoire Vincent, Florence Forbes
TL;DR
This work tackles leaf/wood discrimination in sparse, irregular ULS forest point clouds, a prerequisite for accurate leaf-area estimation. It introduces SOUL, a geometry-centric semantic segmentation pipeline that uses coordinates-only input, multi-scale geometric features, geodesic voxelization decomposition (GVD) for data partitioning, and a rebalanced loss to address extreme class imbalance. The approach achieves state-of-the-art performance on tropical ULS data and shows promising generalization to other forest datasets and LiDAR devices, aided by the GVD pre-partitioning and ablation-driven feature design. By removing reliance on intensity data and focusing on intrinsic geometric structure, SOUL offers a robust, transfer-friendly tool for forest monitoring and leaf-area research, with potential extensions to denser canopies and broader LiDAR configurations.
Abstract
LiDAR (Light Detection and Ranging) has become an essential part of the remote sensing toolbox used for biosphere monitoring. In particular, LiDAR provides the opportunity to map forest leaf area with unprecedented accuracy, while leaf area has remained an important source of uncertainty affecting models of gas exchanges between the vegetation and the atmosphere. Unmanned Aerial Vehicles (UAV) are easy to mobilize and therefore allow frequent revisits to track the response of vegetation to climate change. However, miniature sensors embarked on UAVs usually provide point clouds of limited density, which are further affected by a strong decrease in density from top to bottom of the canopy due to progressively stronger occlusion. In such a context, discriminating leaf points from wood points presents a significant challenge due in particular to strong class imbalance and spatially irregular sampling intensity. Here we introduce a neural network model based on the Pointnet ++ architecture which makes use of point geometry only (excluding any spectral information). To cope with local data sparsity, we propose an innovative sampling scheme which strives to preserve local important geometric information. We also propose a loss function adapted to the severe class imbalance. We show that our model outperforms state-of-the-art alternatives on UAV point clouds. We discuss future possible improvements, particularly regarding much denser point clouds acquired from below the canopy.
