SAIP-Net: Enhancing Remote Sensing Image Segmentation via Spectral Adaptive Information Propagation
Zhongtao Wang, Xizhe Cao, Yisong Chen, Guoping Wang
TL;DR
This paper tackles semantic segmentation of remote sensing imagery, where large intra-class variance and boundary precision are major challenges. It introduces SAIP-Net, a frequency-aware framework that leverages Spectral Adaptive Information Propagation, combining a Transformer-based encoder with a Spectral Adaptive Feature Fusion decoder, Composite Dilated Convolutions, and a Learnable High-Pass Filter Stem. The approach jointly suppresses disruptive high-frequency noise within regions and sharpens boundary details while expanding receptive fields for better multi-scale context. Experimental results on Potsdam and LoveDA demonstrate improved intra-class consistency and boundary accuracy with competitive model complexity, and ablations validate the contribution of each spectral and architectural component, albeit with some limitations on low-texture classes. The work highlights the practical potential of integrating spectral priors with efficient context modeling for robust remote sensing segmentation.
Abstract
Semantic segmentation of remote sensing imagery demands precise spatial boundaries and robust intra-class consistency, challenging conventional hierarchical models. To address limitations arising from spatial domain feature fusion and insufficient receptive fields, this paper introduces SAIP-Net, a novel frequency-aware segmentation framework that leverages Spectral Adaptive Information Propagation. SAIP-Net employs adaptive frequency filtering and multi-scale receptive field enhancement to effectively suppress intra-class feature inconsistencies and sharpen boundary lines. Comprehensive experiments demonstrate significant performance improvements over state-of-the-art methods, highlighting the effectiveness of spectral-adaptive strategies combined with expanded receptive fields for remote sensing image segmentation.
