Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans
Romain Loiseau, Elliot Vincent, Mathieu Aubry, Loic Landrieu
TL;DR
This work introduces Learnable Earth Parser, an unsupervised framework that decomposes large aerial LiDAR scans into a small set of learnable 3D prototypes. By employing S slots and K learnable prototypes with a probabilistic selection mechanism, the model reconstructs scenes through deformations of prototypes and unions across activated slots, enabling interpretable decompositions and downstream unsupervised instance and semantic segmentation. The approach is trained with a reconstruction-plus-regularization objective that leverages asymmetric Chamfer distances and multiple priors to avoid degenerate solutions, and it is supported by a new Earth Parser Dataset comprising seven diverse aerial LiDAR scenes. Empirical results show competitive reconstruction quality and superior semantic segmentation across scenes, with qualitative demonstrations of interpretable prototypes and instance segmentation. The work demonstrates that scene-specific prototypes can robustly parse complex real-world 3D data, offering practical tools for environmental monitoring and mapping without annotations.
Abstract
We propose an unsupervised method for parsing large 3D scans of real-world scenes with easily-interpretable shapes. This work aims to provide a practical tool for analyzing 3D scenes in the context of aerial surveying and mapping, without the need for user annotations. Our approach is based on a probabilistic reconstruction model that decomposes an input 3D point cloud into a small set of learned prototypical 3D shapes. The resulting reconstruction is visually interpretable and can be used to perform unsupervised instance and low-shot semantic segmentation of complex scenes. We demonstrate the usefulness of our model on a novel dataset of seven large aerial LiDAR scans from diverse real-world scenarios. Our approach outperforms state-of-the-art unsupervised methods in terms of decomposition accuracy while remaining visually interpretable. Our code and dataset are available at https://romainloiseau.fr/learnable-earth-parser/
