Meta-TTT: A Meta-learning Minimax Framework For Test-Time Training

Chen Tao; Li Shen; Soumik Mondal

Meta-TTT: A Meta-learning Minimax Framework For Test-Time Training

Chen Tao, Li Shen, Soumik Mondal

TL;DR

This paper introduces a meta-learning minimax framework for test-time training on batch normalization (BN) layers, ensuring that the SSL task aligns with the primary task while addressing minibatch overfitting.

Abstract

Test-time domain adaptation is a challenging task that aims to adapt a pre-trained model to limited, unlabeled target data during inference. Current methods that rely on self-supervision and entropy minimization underperform when the self-supervised learning (SSL) task does not align well with the primary objective. Additionally, minimizing entropy can lead to suboptimal solutions when there is limited diversity within minibatches. This paper introduces a meta-learning minimax framework for test-time training on batch normalization (BN) layers, ensuring that the SSL task aligns with the primary task while addressing minibatch overfitting. We adopt a mixed-BN approach that interpolates current test batch statistics with the statistics from source domains and propose a stochastic domain synthesizing method to improve model generalization and robustness to domain shifts. Extensive experiments demonstrate that our method surpasses state-of-the-art techniques across various domain adaptation and generalization benchmarks, significantly enhancing the pre-trained model's robustness on unseen domains.

Meta-TTT: A Meta-learning Minimax Framework For Test-Time Training

TL;DR

Abstract

Paper Structure (37 sections, 9 equations, 1 figure, 8 tables, 1 algorithm)

This paper contains 37 sections, 9 equations, 1 figure, 8 tables, 1 algorithm.

Introduction
Related Work
Unsupervised Domain Adaptation
Adversarial optimization of domain divergence
Domain Generalization
Meta-learning For Domain Generalization
Test-Time Adaptation
Test-Time Training
Challenges In Test-Time Training
Our Method
Mixed-BN Adaptation
Meta-Learning for Test-time Training
Minimax Entropy Objective
Theoretical Insights
Meta-learning framework
...and 22 more sections

Figures (1)

Figure 1: Training curves comparing minimax entropy and traditional entropy (ERM) on Gaussian noise corruption at the highest severity on CIFAR10-C

Meta-TTT: A Meta-learning Minimax Framework For Test-Time Training

TL;DR

Abstract

Meta-TTT: A Meta-learning Minimax Framework For Test-Time Training

Authors

TL;DR

Abstract

Table of Contents

Figures (1)