Domain Generalization with MixStyle

Kaiyang Zhou; Yongxin Yang; Yu Qiao; Tao Xiang

Domain Generalization with MixStyle

Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang

TL;DR

MixStyle tackles domain generalization by perturbing the style component of features rather than generating images. It randomly mixes per-instance feature statistics across domains in early CNN layers, generating diverse, pseudo-novel styles during training. The method is simple to implement as a plug-in and shows strong improvements on classification (PACS), cross-dataset person re-ID, and reinforcement learning tasks like CoinRun, often outperforming or matching state-of-the-art DG methods with lower overhead. This work demonstrates the value of feature-level augmentation targeting style statistics for robust generalization across domain shifts.

Abstract

Though convolutional neural networks (CNNs) have demonstrated remarkable ability in learning discriminative features, they often generalize poorly to unseen domains. Domain generalization aims to address this problem by learning from a set of source domains a model that is generalizable to any unseen domain. In this paper, a novel approach is proposed based on probabilistically mixing instance-level feature statistics of training samples across source domains. Our method, termed MixStyle, is motivated by the observation that visual domain is closely related to image style (e.g., photo vs.~sketch images). Such style information is captured by the bottom layers of a CNN where our proposed style-mixing takes place. Mixing styles of training instances results in novel domains being synthesized implicitly, which increase the domain diversity of the source domains, and hence the generalizability of the trained model. MixStyle fits into mini-batch training perfectly and is extremely easy to implement. The effectiveness of MixStyle is demonstrated on a wide range of tasks including category classification, instance retrieval and reinforcement learning.

Domain Generalization with MixStyle

TL;DR

Abstract

Domain Generalization with MixStyle

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)