Latent Domain Modeling Improves Robustness to Geographic Shifts

Ruth Crasto; Esther Rolf

Latent Domain Modeling Improves Robustness to Geographic Shifts

Ruth Crasto, Esther Rolf

TL;DR

The paper addresses geographic distribution shift by reframing it as a subpopulation shift and proposes latent domain modeling via location encoders that learn continuous domain latents conditioned on the input. The core method fuses image features with geospatial encodings through a configurable fusion module and is trained with a task loss plus an auxiliary domain-prediction loss that guides the location embeddings. Empirical results across four geo-tagged datasets show consistent improvements in worst-group performance, with new state-of-the-art on WILDS FMoW and PovertyMap, and favorable Pareto trade-offs between worst-group and average accuracy. The approach is efficient, adaptable to various location encoders and fusion strategies, and has practical implications for robust global-scale deployment in geospatial prediction tasks.

Abstract

Geographic distribution shift arises when the distribution of locations on Earth in a training dataset is different from what is seen at inference time. Using standard empirical risk minimization (ERM) in this setting can lead to uneven generalization across different spatially-determined groups of interest such as continents or biomes. The most common approaches to tackling geographic distribution shift apply domain adaptation methods using discrete group labels, ignoring geographic coordinates that are often available as metadata. On the other hand, modeling methods that integrate geographic coordinates have been shown to improve overall performance, but their impact on geographic domain generalization has not been studied. In this work, we propose a general modeling framework for improving robustness to geographic distribution shift. The key idea is to model continuous, latent domain assignment using location encoders and to condition the main task predictor on the jointly-trained latents. On four diverse geo-tagged image datasets with different group splits, we show that instances of our framework achieve significant improvements in worst-group performance compared to existing domain adaptation and location-aware modeling methods. In particular, we achieve new state-of-the-art results on two datasets from the WILDS benchmark.

Latent Domain Modeling Improves Robustness to Geographic Shifts

TL;DR

Abstract

Latent Domain Modeling Improves Robustness to Geographic Shifts

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)