Generalizable Slum Detection from Satellite Imagery with Mixture-of-Experts

Sumin Lee; Sungwon Park; Jeasurk Yang; Jihee Kim; Meeyoung Cha

Generalizable Slum Detection from Satellite Imagery with Mixture-of-Experts

Sumin Lee, Sungwon Park, Jeasurk Yang, Jihee Kim, Meeyoung Cha

TL;DR

GRAM tackles cross-region generalization in satellite-based slum segmentation by coupling a Mixture-of-Experts backbone with test-time adaptation. It learns region-specific adapters via adaptive routing, while enforcing universal representations through mutual-information regularization, and filters pseudo-labels with cross-region prediction consistency during target adaptation. Evaluated on a large, multi-continental dataset with three unseen African cities, GRAM outperforms state-of-the-art baselines, especially in low-resource settings, demonstrating label-efficient scalability for global slum monitoring. By enabling temporal tracking of informal settlements, GRAM provides actionable, data-driven insights to support urban policy and planning in data-scarce contexts.

Abstract

Satellite-based slum segmentation holds significant promise in generating global estimates of urban poverty. However, the morphological heterogeneity of informal settlements presents a major challenge, hindering the ability of models trained on specific regions to generalize effectively to unseen locations. To address this, we introduce a large-scale high-resolution dataset and propose GRAM (Generalized Region-Aware Mixture-of-Experts), a two-phase test-time adaptation framework that enables robust slum segmentation without requiring labeled data from target regions. We compile a million-scale satellite imagery dataset from 12 cities across four continents for source training. Using this dataset, the model employs a Mixture-of-Experts architecture to capture region-specific slum characteristics while learning universal features through a shared backbone. During adaptation, prediction consistency across experts filters out unreliable pseudo-labels, allowing the model to generalize effectively to previously unseen regions. GRAM outperforms state-of-the-art baselines in low-resource settings such as African cities, offering a scalable and label-efficient solution for global slum mapping and data-driven urban planning.

Generalizable Slum Detection from Satellite Imagery with Mixture-of-Experts

TL;DR

Abstract

Generalizable Slum Detection from Satellite Imagery with Mixture-of-Experts

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)