Generalizing to Unseen Domains: A Survey on Domain Generalization
Jindong Wang, Cuiling Lan, Chang Liu, Yidong Ouyang, Tao Qin, Wang Lu, Yiqiang Chen, Wenjun Zeng, Philip S. Yu
TL;DR
This survey comprehensively maps the domain generalization landscape, formalizing the problem, summarizing key theory, and organizing methods into data manipulation, representation learning, and learning strategies. It connects DG with related fields through formal bounds and discusses practical considerations, benchmark datasets, and a public codebase (DeepDG) to standardize evaluation. The work highlights open challenges including continuous and open-domain DG, interpretability, and test-time generalization, offering a roadmap for future research and application across vision, NLP, and healthcare. By synthesizing theory, algorithms, and benchmarks, it provides researchers with concrete directions to build models that robustly generalize to unseen domains.
Abstract
Machine learning systems generally assume that the training and testing distributions are the same. To this end, a key requirement is to develop models that can generalize to unseen distributions. Domain generalization (DG), i.e., out-of-distribution generalization, has attracted increasing interests in recent years. Domain generalization deals with a challenging setting where one or several different but related domain(s) are given, and the goal is to learn a model that can generalize to an unseen test domain. Great progress has been made in the area of domain generalization for years. This paper presents the first review of recent advances in this area. First, we provide a formal definition of domain generalization and discuss several related fields. We then thoroughly review the theories related to domain generalization and carefully analyze the theory behind generalization. We categorize recent algorithms into three classes: data manipulation, representation learning, and learning strategy, and present several popular algorithms in detail for each category. Third, we introduce the commonly used datasets, applications, and our open-sourced codebase for fair evaluation. Finally, we summarize existing literature and present some potential research topics for the future.
