Domain Generalization via Discrete Codebook Learning
Shaocong Long, Qianyu Zhou, Xikun Jiang, Chenhao Ying, Lizhuang Ma, Yuan Luo
TL;DR
This work tackles domain generalization by shifting from pixel-level continuous representations to semantic-level discrete representations. It establishes a theoretical basis showing discretization can reduce domain gaps and presents Discrete Domain Generalization (DDG), which quantizes encoder features into a learnable codebook and trains with a teacher–student framework and EMA updates. The method combines classification, consistency, and codebook-related losses, and experiments across PACS, TerraIncognita, and VLCS demonstrate consistent improvements over SOTA with strong generalization and stability. Overall, DDG offers a principled, efficient route to robust DG by prioritizing semantic information through discrete representations.
Abstract
Domain generalization (DG) strives to address distribution shifts across diverse environments to enhance model's generalizability. Current DG approaches are confined to acquiring robust representations with continuous features, specifically training at the pixel level. However, this DG paradigm may struggle to mitigate distribution gaps in dealing with a large space of continuous features, rendering it susceptible to pixel details that exhibit spurious correlations or noise. In this paper, we first theoretically demonstrate that the domain gaps in continuous representation learning can be reduced by the discretization process. Based on this inspiring finding, we introduce a novel learning paradigm for DG, termed Discrete Domain Generalization (DDG). DDG proposes to use a codebook to quantize the feature map into discrete codewords, aligning semantic-equivalent information in a shared discrete representation space that prioritizes semantic-level information over pixel-level intricacies. By learning at the semantic level, DDG diminishes the number of latent features, optimizing the utilization of the representation space and alleviating the risks associated with the wide-ranging space of continuous features. Extensive experiments across widely employed benchmarks in DG demonstrate DDG's superior performance compared to state-of-the-art approaches, underscoring its potential to reduce the distribution gaps and enhance the model's generalizability.
