EIDSeg: A Pixel-Level Semantic Segmentation Dataset for Post-Earthquake Damage Assessment from Social Media Images

Huili Huang; Chengeng Liu; Danrong Zhang; Shail Patel; Anastasiya Masalava; Sagar Sadak; Parisa Babolhavaeji; WeiHong Low; Max Mahdi Roozbahani; J. David Frost

EIDSeg: A Pixel-Level Semantic Segmentation Dataset for Post-Earthquake Damage Assessment from Social Media Images

Huili Huang, Chengeng Liu, Danrong Zhang, Shail Patel, Anastasiya Masalava, Sagar Sadak, Parisa Babolhavaeji, WeiHong Low, Max Mahdi Roozbahani, J. David Frost

TL;DR

EIDSeg addresses the lack of pixel-level, ground-view labeled data for post-earthquake damage assessment by introducing a large-scale segmentation dataset drawn from social media across nine earthquakes. It implements a three-phase, cross-disciplinary annotation protocol enabling non-experts to produce high-quality pixel-level masks for five infrastructure classes, achieving inter-annotator agreement above 70%. Benchmark experiments across eight state-of-the-art models identify Encoder-only Mask Transformer (EoMT) as the top performer with mIoU around 80.8% and PA around 90.3%, demonstrating strong potential for fine-grained, rapid damage assessment from public imagery. The dataset, along with its rigorous labeling guidelines, lays a foundation for faster, more granular post-disaster analysis and cross-event generalization in real-world rescue and recovery efforts.

Abstract

Rapid post-earthquake damage assessment is crucial for rescue and resource planning. Still, existing remote sensing methods depend on costly aerial images, expert labeling, and produce only binary damage maps for early-stage evaluation. Although ground-level images from social networks provide a valuable source to fill this gap, a large pixel-level annotated dataset for this task is still unavailable. We introduce EIDSeg, the first large-scale semantic segmentation dataset specifically for post-earthquake social media imagery. The dataset comprises 3,266 images from nine major earthquakes (2008-2023), annotated across five classes of infrastructure damage: Undamaged Building, Damaged Building, Destroyed Building, Undamaged Road, and Damaged Road. We propose a practical three-phase cross-disciplinary annotation protocol with labeling guidelines that enables consistent segmentation by non-expert annotators, achieving over 70% inter-annotator agreement. We benchmark several state-of-the-art segmentation models, identifying Encoder-only Mask Transformer (EoMT) as the top-performing method with a Mean Intersection over Union (mIoU) of 80.8%. By unlocking social networks' rich ground-level perspective, our work paves the way for a faster, finer-grained damage assessment in the post-earthquake scenario.

EIDSeg: A Pixel-Level Semantic Segmentation Dataset for Post-Earthquake Damage Assessment from Social Media Images

TL;DR

Abstract

EIDSeg: A Pixel-Level Semantic Segmentation Dataset for Post-Earthquake Damage Assessment from Social Media Images

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)