Table of Contents
Fetching ...

TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model

Changhong Hou, Junchuan Yu, Daqing Ge, Liu Yang, Laidian Xi, Yunxuan Pang, Yi Wen

TL;DR

TransLandSeg is proposed, which is a transfer learning approach for landslide semantic segmentation based on a vision foundation model (VFM) and outperforms traditional semantic segmentation models on both the Landslide4Sense dataset and the Bijie landslide dataset.

Abstract

Landslides are one of the most destructive natural disasters in the world, posing a serious threat to human life and safety. The development of foundation models has provided a new research paradigm for large-scale landslide detection. The Segment Anything Model (SAM) has garnered widespread attention in the field of image segmentation. However, our experiment found that SAM performed poorly in the task of landslide segmentation. We propose TransLandSeg, which is a transfer learning approach for landslide semantic segmentation based on a vision foundation model (VFM). TransLandSeg outperforms traditional semantic segmentation models on both the Landslide4Sense dataset and the Bijie landslide dataset. Our proposed adaptive transfer learning (ATL) architecture enables the powerful segmentation capability of SAM to be transferred to landslide detection by training only 1.3% of the number of the parameters of SAM, which greatly improves the training efficiency of the model. Finally we also conducted ablation experiments on models with different ATL structures, concluded that the deployment location and residual connection of ATL play an important role in TransLandSeg accuracy improvement.

TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model

TL;DR

TransLandSeg is proposed, which is a transfer learning approach for landslide semantic segmentation based on a vision foundation model (VFM) and outperforms traditional semantic segmentation models on both the Landslide4Sense dataset and the Bijie landslide dataset.

Abstract

Landslides are one of the most destructive natural disasters in the world, posing a serious threat to human life and safety. The development of foundation models has provided a new research paradigm for large-scale landslide detection. The Segment Anything Model (SAM) has garnered widespread attention in the field of image segmentation. However, our experiment found that SAM performed poorly in the task of landslide segmentation. We propose TransLandSeg, which is a transfer learning approach for landslide semantic segmentation based on a vision foundation model (VFM). TransLandSeg outperforms traditional semantic segmentation models on both the Landslide4Sense dataset and the Bijie landslide dataset. Our proposed adaptive transfer learning (ATL) architecture enables the powerful segmentation capability of SAM to be transferred to landslide detection by training only 1.3% of the number of the parameters of SAM, which greatly improves the training efficiency of the model. Finally we also conducted ablation experiments on models with different ATL structures, concluded that the deployment location and residual connection of ATL play an important role in TransLandSeg accuracy improvement.
Paper Structure (25 sections, 15 equations, 10 figures, 4 tables, 1 algorithm)

This paper contains 25 sections, 15 equations, 10 figures, 4 tables, 1 algorithm.

Figures (10)

  • Figure 1: Structure of the proposed TransLandSeg and Segment Anything Model (SAM).
  • Figure 2: Adaptive transfer learning structure. (a) ATL structure. (b) MidLay_m structure. (c) MidLay_c structure.
  • Figure 3: Four different adaptive transfer learning structures. (a) 2-MidLay_m. (b) 2-MidLay_c. (c) 3-MidLay_c. (d) 2-MidLay_m + 3-MidLay_c.
  • Figure 4: ATL placed inside the Transformer block. (a) The original Transformer block structure in SAM's image encoder. (b) Structure of ATL placed inside the Transformer block. (c) ATL placed inside the Transformer block to train the model.
  • Figure 5: ATL without residual connection. (a) ATL without residual connection placed outside the Transformer block. (b) Schematic of ATL without residual connection placed inside the Transformer block. (c) ATL without residual connection placed inside the Transformer block to train the model.
  • ...and 5 more figures