Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework
Zhuohong Li, Fangxiao Lu, Jiaqi Zou, Lei Hu, Hongyan Zhang
TL;DR
The paper tackles the challenge of updating high-resolution land-cover maps to incorporate newly appearing classes under limited labeled data. It presents SegLand, a GFSS-based framework with three components: data preprocessing to balance and augment data, a hybrid segmentation module combining multiple base learners and a Projection onto Orthogonal Prototypes (POP) network, and an ultimate fusion stage to produce robust maps that reflect both base and novel classes. Key contributions include novel data augmentation (NovelCutMix), a four-model base-learner ensemble with average fusion, and a two-stage POP training regime with orthogonality constraints and backbone/decoder selections (notably Swin Transformer + UperNetPlus) that enable effective novel-class updating without degrading base-class performance. The approach achieves top performance in the OpenEarthMap Land Cover Mapping Few-Shot Challenge, demonstrating practical potential for fast, scalable updates of HR land-cover maps with limited annotated data, and offering a pathway to more responsive Earth observation products and downstream applications.
Abstract
Land-cover mapping is one of the vital applications in Earth observation, aiming at classifying each pixel's land-cover type of remote-sensing images. As natural and human activities change the landscape, the land-cover map needs to be rapidly updated. However, discovering newly appeared land-cover types in existing classification systems is still a non-trivial task hindered by various scales of complex land objects and insufficient labeled data over a wide-span geographic area. In this paper, we propose a generalized few-shot segmentation-based framework, named SegLand, to update novel classes in high-resolution land-cover mapping. Specifically, the proposed framework is designed in three parts: (a) Data pre-processing: the base training set and the few-shot support sets of novel classes are analyzed and augmented; (b) Hybrid segmentation structure; Multiple base learners and a modified Projection onto Orthogonal Prototypes (POP) network are combined to enhance the base-class recognition and to dig novel classes from insufficient labels data; (c) Ultimate fusion: the semantic segmentation results of the base learners and POP network are reasonably fused. The proposed framework has won first place in the leaderboard of the OpenEarthMap Land Cover Mapping Few-Shot Challenge. Experiments demonstrate the superiority of the framework for automatically updating novel land-cover classes with limited labeled data.
