Improving out-of-distribution generalization via multi-task self-supervised pretraining

Isabela Albuquerque; Nikhil Naik; Junnan Li; Nitish Keskar; Richard Socher

Improving out-of-distribution generalization via multi-task self-supervised pretraining

Isabela Albuquerque, Nikhil Naik, Junnan Li, Nitish Keskar, Richard Socher

TL;DR

The paper investigates domain generalization in computer vision and demonstrates that multi-task self-supervised pretraining can match or surpass supervised pretraining for unseen domains. It introduces a novel Gabor filter response reconstruction task alongside Rotation and DeepCluster within a shared encoder framework, followed by supervised fine-tuning on source domains. Through PACS and VLCS benchmarks, multi-task SSL shows strong transfer to unseen domains, especially under large domain shifts, and yields better object localization than supervised baselines. The work also shows that SSL features can synergize with methods like IRM, highlighting SSL as a robust foundation for domain generalization and cross-domain transfer with limited labeled data.

Abstract

Self-supervised feature representations have been shown to be useful for supervised classification, few-shot learning, and adversarial robustness. We show that features obtained using self-supervised learning are comparable to, or better than, supervised learning for domain generalization in computer vision. We introduce a new self-supervised pretext task of predicting responses to Gabor filter banks and demonstrate that multi-task learning of compatible pretext tasks improves domain generalization performance as compared to training individual tasks alone. Features learnt through self-supervision obtain better generalization to unseen domains when compared to their supervised counterpart when there is a larger domain shift between training and test distributions and even show better localization ability for objects of interest. Self-supervised feature representations can also be combined with other domain generalization methods to further boost performance.

Improving out-of-distribution generalization via multi-task self-supervised pretraining

TL;DR

Abstract

Improving out-of-distribution generalization via multi-task self-supervised pretraining

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)