Table of Contents
Fetching ...

A Dual-Branch Framework for Semantic Change Detection with Boundary and Temporal Awareness

Yun-Cheng Li, Sen Lei, Heng-Chao Li, Ke Li

TL;DR

Semantic Change Detection (SCD) in bi-temporal remote sensing suffers from blurred boundaries and weak temporal modeling. The authors propose DBTANet, a dual-branch Siamese architecture that fuses a frozen Segment Anything Model (SAM) for global semantics and boundary priors with a lightweight ResNet34 for local detail. Two key innovations are introduced: the Gaussian-smoothed Projection Module (GSPM) to purify shallow SAM features and the Bidirectional Temporal Awareness Module (BTAM) to capture multi-scale temporal dependencies, with feature fusion governed by learnable gates $F_{shallow} = (1-α) F^{res}_{shallow} + α GSPM(F^{SAM}_{shallow})$ and $F_{deep} = (1-β) F^{res}_{deep} + β F^{SAM}_{deep}$. Empirical results on Landsat-SCD and SECOND show state-of-the-art performance, validating the effectiveness of combining global priors, boundary cues, temporal reasoning, and boundary-aware constraints for robust SCD.

Abstract

Semantic Change Detection (SCD) aims to detect and categorize land-cover changes from bi-temporal remote sensing images. Existing methods often suffer from blurred boundaries and inadequate temporal modeling, limiting segmentation accuracy. To address these issues, we propose a Dual-Branch Framework for Semantic Change Detection with Boundary and Temporal Awareness, termed DBTANet. Specifically, we utilize a dual-branch Siamese encoder where a frozen SAM branch captures global semantic context and boundary priors, while a ResNet34 branch provides local spatial details, ensuring complementary feature representations. On this basis, we design a Bidirectional Temporal Awareness Module (BTAM) to aggregate multi-scale features and capture temporal dependencies in a symmetric manner. Furthermore, a Gaussian-smoothed Projection Module (GSPM) refines shallow SAM features, suppressing noise while enhancing edge information for boundary-aware constraints. Extensive experiments on two public benchmarks demonstrate that DBTANet effectively integrates global semantics, local details, temporal reasoning, and boundary awareness, achieving state-of-the-art performance.

A Dual-Branch Framework for Semantic Change Detection with Boundary and Temporal Awareness

TL;DR

Semantic Change Detection (SCD) in bi-temporal remote sensing suffers from blurred boundaries and weak temporal modeling. The authors propose DBTANet, a dual-branch Siamese architecture that fuses a frozen Segment Anything Model (SAM) for global semantics and boundary priors with a lightweight ResNet34 for local detail. Two key innovations are introduced: the Gaussian-smoothed Projection Module (GSPM) to purify shallow SAM features and the Bidirectional Temporal Awareness Module (BTAM) to capture multi-scale temporal dependencies, with feature fusion governed by learnable gates and . Empirical results on Landsat-SCD and SECOND show state-of-the-art performance, validating the effectiveness of combining global priors, boundary cues, temporal reasoning, and boundary-aware constraints for robust SCD.

Abstract

Semantic Change Detection (SCD) aims to detect and categorize land-cover changes from bi-temporal remote sensing images. Existing methods often suffer from blurred boundaries and inadequate temporal modeling, limiting segmentation accuracy. To address these issues, we propose a Dual-Branch Framework for Semantic Change Detection with Boundary and Temporal Awareness, termed DBTANet. Specifically, we utilize a dual-branch Siamese encoder where a frozen SAM branch captures global semantic context and boundary priors, while a ResNet34 branch provides local spatial details, ensuring complementary feature representations. On this basis, we design a Bidirectional Temporal Awareness Module (BTAM) to aggregate multi-scale features and capture temporal dependencies in a symmetric manner. Furthermore, a Gaussian-smoothed Projection Module (GSPM) refines shallow SAM features, suppressing noise while enhancing edge information for boundary-aware constraints. Extensive experiments on two public benchmarks demonstrate that DBTANet effectively integrates global semantics, local details, temporal reasoning, and boundary awareness, achieving state-of-the-art performance.
Paper Structure (9 sections, 4 equations, 2 figures, 3 tables)

This paper contains 9 sections, 4 equations, 2 figures, 3 tables.

Figures (2)

  • Figure 1: The overall framework of the proposed DBTANet.
  • Figure 2: Visual comparison on the Landsat-SCD dataset.