Generative Classifier for Domain Generalization

Shaocong Long; Qianyu Zhou; Xiangtai Li; Chenhao Ying; Yunhai Tong; Lizhuang Ma; Yuan Luo; Dacheng Tao

Generative Classifier for Domain Generalization

Shaocong Long, Qianyu Zhou, Xiangtai Li, Chenhao Ying, Yunhai Tong, Lizhuang Ma, Yuan Luo, Dacheng Tao

TL;DR

This work tackles domain generalization by challenging the default DG practice of strict domain invariance with a generative classifier that can model multi-modal, domain-specific feature distributions. The proposed GCDG framework replaces the discriminative linear classifier with a Gaussian Mixture Model-based HLC, augmented by Spurious Correlation Blocking and Diverse Component Balancing to capture beneficial domain-specific information while mitigating spurious patterns. The authors provide theoretical results showing that enforcing invariance can raise the target risk bound, and that relaxing invariance via a generative approach can reduce this bound and promote flat minima. Empirically, GCDG achieves competitive or state-of-the-art performance across five DG benchmarks and a face anti-spoofing dataset, and can be integrated as a plug-in with existing DG methods. The work offers a new direction for DG by leveraging domain-specific information through a principled generative classifier, with practical implications for robust cross-domain understanding.

Abstract

Domain generalization (DG) aims to improve the generalizability of computer vision models toward distribution shifts. The mainstream DG methods focus on learning domain invariance, however, such methods overlook the potential inherent in domain-specific information. While the prevailing practice of discriminative linear classifier has been tailored to domain-invariant features, it struggles when confronted with diverse domain-specific information, e.g., intra-class shifts, that exhibits multi-modality. To address these issues, we explore the theoretical implications of relying on domain invariance, revealing the crucial role of domain-specific information in mitigating the target risk for DG. Drawing from these insights, we propose Generative Classifier-driven Domain Generalization (GCDG), introducing a generative paradigm for the DG classifier based on Gaussian Mixture Models (GMMs) for each class across domains. GCDG consists of three key modules: Heterogeneity Learning Classifier~(HLC), Spurious Correlation Blocking~(SCB), and Diverse Component Balancing~(DCB). Concretely, HLC attempts to model the feature distributions and thereby capture valuable domain-specific information via GMMs. SCB identifies the neural units containing spurious correlations and perturbs them, mitigating the risk of HLC learning spurious patterns. Meanwhile, DCB ensures a balanced contribution of components in HLC, preventing the underestimation or neglect of critical components. In this way, GCDG excels in capturing the nuances of domain-specific information characterized by diverse distributions. GCDG demonstrates the potential to reduce the target risk and encourage flat minima, improving the generalizability. Extensive experiments show GCDG's comparable performance on five DG benchmarks and one face anti-spoofing dataset, seamlessly integrating into existing DG methods with consistent improvements.

Generative Classifier for Domain Generalization

TL;DR

Abstract

Generative Classifier for Domain Generalization

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (4)