Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels
Zhuohong Li, Wei He, Jiepan Li, Fangxiao Lu, Hongyan Zhang
TL;DR
This work tackles updating large-scale high-resolution land-cover maps when only low-resolution historical labels are available. It introduces Paraformer, a weakly supervised framework that fuses a resolution-preserving CNN branch with a Transformer-based global modeling branch, augmented by the PLAT module to refine LR labels into reliable supervision. The approach optimizes with L_total = L_ce + L_mce, where L_mce is computed via Mask-Cross-Entropy on iteratively refined mask labels, enabling end-to-end training without HR labels. Experiments on Chesapeake Bay and Poland datasets show Paraformer outperforms state-of-the-art methods in mIoU across diverse LR label scenarios, demonstrating robust HR map updating across wide-spread landforms. The results suggest Paraformer’s practical potential for scalable, accurate HR land-cover updates using readily available LR historical data.
Abstract
Large-scale high-resolution (HR) land-cover mapping is a vital task to survey the Earth's surface and resolve many challenges facing humanity. However, it is still a non-trivial task hindered by complex ground details, various landforms, and the scarcity of accurate training labels over a wide-span geographic area. In this paper, we propose an efficient, weakly supervised framework (Paraformer) to guide large-scale HR land-cover mapping with easy-access historical land-cover data of low resolution (LR). Specifically, existing land-cover mapping approaches reveal the dominance of CNNs in preserving local ground details but still suffer from insufficient global modeling in various landforms. Therefore, we design a parallel CNN-Transformer feature extractor in Paraformer, consisting of a downsampling-free CNN branch and a Transformer branch, to jointly capture local and global contextual information. Besides, facing the spatial mismatch of training data, a pseudo-label-assisted training (PLAT) module is adopted to reasonably refine LR labels for weakly supervised semantic segmentation of HR images. Experiments on two large-scale datasets demonstrate the superiority of Paraformer over other state-of-the-art methods for automatically updating HR land-cover maps from LR historical labels.
