Domain Generalization by Rejecting Extreme Augmentations
Masih Aminbeidokhti, Fidel A. Guerrero Peña, Heitor Rapela Medeiros, Thomas Dubail, Eric Granger, Marco Pedersoli
TL;DR
Domain generalization is improved by a simple augmentation strategy that expands the transformation space and per-sample selects between weak and wider transforms using a reward that blends diversity and semantic consistency. The reward $R(\tilde{x}, z) = (1-\lambda) R_{div}(\tilde{x}, z) - \lambda R_{con}(\tilde{x}, z)$ leverages an EMA teacher to enforce consistency while encouraging varied inputs, and training alternates between reward computation and gradient updates. Evaluations on five DG benchmarks (PACS, VLCS, OfficeHome, TerraIncognita, DomainNet) show competitive to state-of-the-art performance, with TeachDCAug$_{label}$ often achieving the best averages and good robustness when using ViT backbones. Overall, the work demonstrates that carefully controlled aggressive augmentation, filtered by a per-sample reward, is a practical and effective tool for domain generalization across diverse visual domains.
Abstract
Data augmentation is one of the most effective techniques for regularizing deep learning models and improving their recognition performance in a variety of tasks and domains. However, this holds for standard in-domain settings, in which the training and test data follow the same distribution. For the out-of-domain case, where the test data follow a different and unknown distribution, the best recipe for data augmentation is unclear. In this paper, we show that for out-of-domain and domain generalization settings, data augmentation can provide a conspicuous and robust improvement in performance. To do that, we propose a simple training procedure: (i) use uniform sampling on standard data augmentation transformations; (ii) increase the strength transformations to account for the higher data variance expected when working out-of-domain, and (iii) devise a new reward function to reject extreme transformations that can harm the training. With this procedure, our data augmentation scheme achieves a level of accuracy that is comparable to or better than state-of-the-art methods on benchmark domain generalization datasets. Code: https://github.com/Masseeh/DCAug
