Probabilistic Test-Time Generalization by Variational Neighbor-Labeling

Sameer Ambekar; Zehao Xiao; Jiayi Shen; Xiantong Zhen; Cees G. M. Snoek

Probabilistic Test-Time Generalization by Variational Neighbor-Labeling

Sameer Ambekar, Zehao Xiao, Jiayi Shen, Xiantong Zhen, Cees G. M. Snoek

TL;DR

This work tackles domain generalization under test-time shifts by leveraging unlabeled target data at inference. It introduces a probabilistic framework that treats pseudo labels as latent distributions and augments them with variational neighbor labels, coupled with a meta-generalization training loop to simulate target-domain adaptation. The key contributions are probabilistic pseudo-labeling, variational neighbor labels, and the meta-generalization stage, collectively enabling more robust and calibrated generalization to unseen targets. Across seven domain-generalization benchmarks, the method achieves competitive or state-of-the-art results, with notable gains in calibration and in challenging or small-target-batch scenarios, demonstrating practical effectiveness for test-time domain generalization.

Abstract

This paper strives for domain generalization, where models are trained exclusively on source domains before being deployed on unseen target domains. We follow the strict separation of source training and target testing, but exploit the value of the unlabeled target data itself during inference. We make three contributions. First, we propose probabilistic pseudo-labeling of target samples to generalize the source-trained model to the target domain at test time. We formulate the generalization at test time as a variational inference problem, by modeling pseudo labels as distributions, to consider the uncertainty during generalization and alleviate the misleading signal of inaccurate pseudo labels. Second, we learn variational neighbor labels that incorporate the information of neighboring target samples to generate more robust pseudo labels. Third, to learn the ability to incorporate more representative target information and generate more precise and robust variational neighbor labels, we introduce a meta-generalization stage during training to simulate the generalization procedure. Experiments on seven widely-used datasets demonstrate the benefits, abilities, and effectiveness of our proposal.

Probabilistic Test-Time Generalization by Variational Neighbor-Labeling

TL;DR

Abstract

Paper Structure (28 sections, 17 equations, 6 figures, 12 tables, 2 algorithms)

This paper contains 28 sections, 17 equations, 6 figures, 12 tables, 2 algorithms.

Introduction
Related Work
Methodology
Probabilistic pseudo-labeling
Variational neighbor labels
Meta-generalization with variational neighbor labels.
Test-time generalization.
Experiments
Datasets and implementation details
Seven datasets.
Results
State-of-the-art comparisons.
Comparison on rotated MNIST and Fashion-MNIST.
Ablations
Benefit of method components.
...and 13 more sections

Figures (6)

Figure 1: Probabilistic modeling graph for test-time domain generalization. (a) Common test-time domain generalization algorithm obtains the target model $\boldsymbol{\theta}_{t}$ by self-learning of the unlabeled target data $\mathbf{x}_t$ on source-trained model $\boldsymbol{\theta}_{s}$iwasawa2021testjang2022test. (b) We introduce pseudo labels $p(\hat{\mathbf{y}}_t)$ as a latent variable to generate $p(\boldsymbol{\theta}_{t})$ for more robust generalization. (c) We further propose variational neighbor labels to incorporate neighboring information into the generation of pseudo labels, where latent variable $\mathbf{w}_t$ and $\hat{\mathbf{y}}_t$ follow Gaussian and categorical distributions. We introduce a meta-generalization stage during training to optimize our model.
Figure 2: Comparison on hard examples from xiao2022learningon PACS. Our proposal is more robust on samples with multiple objectives or complex scenes.
Figure 3: Calibration ability on PACS. Variational neighbor labels consistently have a lower Expected Calibration Error.
Figure 4: Generalization with varying batch sizes. Our method outperforms soft pseudo label on PACS, independent of batch size. Largest improvement for small batches.
Figure 5: Generalization along with inference. Our method achieves faster generalization with less prone to saturation.
...and 1 more figures

Probabilistic Test-Time Generalization by Variational Neighbor-Labeling

TL;DR

Abstract

Probabilistic Test-Time Generalization by Variational Neighbor-Labeling

Authors

TL;DR

Abstract

Table of Contents

Figures (6)