Multitask GLocal OBIA-Mamba for Sentinel-2 Landcover Mapping
Zack Dewis, Yimin Zhu, Zhengsen Xu, Mabel Heffring, Saeid Taleghanidoozdoozan, Kaylee Xiao, Motasem Alkayid, Lincoln Linlin Xu
TL;DR
This work tackles Sentinel-2 land use/land cover mapping, hampered by spatial heterogeneity and signature ambiguity. It introduces Multitask Glocal OBIA-Mamba (MSOM), combining an OBIA-Mamba module with a dual-branch GLocal CNN-Mamba architecture and a multitask loss to balance local precision and global coherence. Key innovations include using superpixel tokens within a state space Mamba framework for efficient global modeling, a CNN-Mamba dual pathway for local-global fusion, and a weighted loss ($L_{\text{total}} = \alpha L_{\text{local}} + \beta L_{\text{global}}$) with $\alpha=0.7$, $\beta=0.3$. On Alberta Sentinel-2 data, MSOM outperforms state-of-the-art baselines in OA, AA, and Kappa, while preserving edges and finer details, demonstrating both accuracy and computational efficiency. This approach offers a scalable and robust pipeline for LULC mapping in settings with uncertain ground-truth boundaries.
Abstract
Although Sentinel-2 based land use and land cover (LULC) classification is critical for various environmental monitoring applications, it is a very difficult task due to some key data challenges (e.g., spatial heterogeneity, context information, signature ambiguity). This paper presents a novel Multitask Glocal OBIA-Mamba (MSOM) for enhanced Sentinel-2 classification with the following contributions. First, an object-based image analysis (OBIA) Mamba model (OBIA-Mamba) is designed to reduce redundant computation without compromising fine-grained details by using superpixels as Mamba tokens. Second, a global-local (GLocal) dual-branch convolutional neural network (CNN)-mamba architecture is designed to jointly model local spatial detail and global contextual information. Third, a multitask optimization framework is designed to employ dual loss functions to balance local precision with global consistency. The proposed approach is tested on Sentinel-2 imagery in Alberta, Canada, in comparison with several advanced classification approaches, and the results demonstrate that the proposed approach achieves higher classification accuracy and finer details that the other state-of-the-art methods.
