CFNet: Optimizing Remote Sensing Change Detection through Content-Aware Enhancement
Fan Wu, Sijun Dong, Xiaoliang Meng
TL;DR
CFNet tackles unpredictable style differences in bi-temporal remote sensing change detection by introducing a Content-Aware constraint and a plug-and-play Focuser module that separately learns Changed Content (CC) and Unchanged Content (UCC). The architecture stacks a partial EfficientNet-B5 encoder with Content Focuser and Change decoders, governed by a multi-term loss $L = \alpha L_{main} + \beta L_{cc} + \gamma L_{ucc}$ where $\alpha=1$, $\beta=\gamma=0.1$, and per-scale RM$_i$ reweights guide feature fusion. Content-Aware losses leverage internal structural similarities via random sampling to promote stable unchanged content representations while emphasizing content changes. Empirically, CFNet achieves state-of-the-art results on CLCD, LEVIR-CD, and SYSU-CD, with ablations confirming the complementary benefits of the Content-Aware strategy and Focuser module, and analyses highlighting effective alignment of RM$_4$ with ground truth.
Abstract
Change detection is a crucial and widely applied task in remote sensing, aimed at identifying and analyzing changes occurring in the same geographical area over time. Due to variability in acquisition conditions, bi-temporal remote sensing images often exhibit significant differences in image style. Even with the powerful generalization capabilities of DNNs, these unpredictable style variations between bi-temporal images inevitably affect model's ability to accurately detect changed areas. To address issue above, we propose the Content Focuser Network (CFNet), which takes content-aware strategy as a key insight. CFNet employs EfficientNet-B5 as the backbone for feature extraction. To enhance the model's focus on the content features of images while mitigating the misleading effects of style features, we develop a constraint strategy that prioritizes the content features of bi-temporal images, termed Content-Aware. Furthermore, to enable the model to flexibly focus on changed and unchanged areas according to the requirements of different stages, we design a reweighting module based on the cosine distance between bi-temporal image features, termed Focuser. CFNet achieve outstanding performance across three well-known change detection datasets: CLCD (F1: 81.41%, IoU: 68.65%), LEVIR-CD (F1: 92.18%, IoU: 85.49%), and SYSU-CD (F1: 82.89%, IoU: 70.78%). The code and pretrained models of CFNet are publicly released at https://github.com/wifiBlack/CFNet.
