Table of Contents
Fetching ...

Domain Generalization In Robust Invariant Representation

Gauri Gupta, Ritvik Kapila, Keshav Gupta, Ramesh Raskar

TL;DR

The paper addresses domain generalization of invariances by modeling identity-preserving transformations as a group action of $G$ on $X$ with orbits $O_x$ and invariant latent $z \in Z = X/G$. It introduces RotInvVAE, which yields an invariant code $z$ and an estimated group action, enabling reconstruction via the decoder; pair verification relies on cosine similarity between $z$ codes. Across unseen domains such as MNIST, FashionMNIST, and LFW, RotInvVAE maintains rotation-invariant latent representations and demonstrates strong generalization, including cross-domain transfers and reduced-label scenarios. The work argues for resource-efficient, domain-general invariance in recognition tasks, offering a first demonstration that invariance can robustly transfer to new domains without retraining.

Abstract

Unsupervised approaches for learning representations invariant to common transformations are used quite often for object recognition. Learning invariances makes models more robust and practical to use in real-world scenarios. Since data transformations that do not change the intrinsic properties of the object cause the majority of the complexity in recognition tasks, models that are invariant to these transformations help reduce the amount of training data required. This further increases the model's efficiency and simplifies training. In this paper, we investigate the generalization of invariant representations on out-of-distribution data and try to answer the question: Do model representations invariant to some transformations in a particular seen domain also remain invariant in previously unseen domains? Through extensive experiments, we demonstrate that the invariant model learns unstructured latent representations that are robust to distribution shifts, thus making invariance a desirable property for training in resource-constrained settings.

Domain Generalization In Robust Invariant Representation

TL;DR

The paper addresses domain generalization of invariances by modeling identity-preserving transformations as a group action of on with orbits and invariant latent . It introduces RotInvVAE, which yields an invariant code and an estimated group action, enabling reconstruction via the decoder; pair verification relies on cosine similarity between codes. Across unseen domains such as MNIST, FashionMNIST, and LFW, RotInvVAE maintains rotation-invariant latent representations and demonstrates strong generalization, including cross-domain transfers and reduced-label scenarios. The work argues for resource-efficient, domain-general invariance in recognition tasks, offering a first demonstration that invariance can robustly transfer to new domains without retraining.

Abstract

Unsupervised approaches for learning representations invariant to common transformations are used quite often for object recognition. Learning invariances makes models more robust and practical to use in real-world scenarios. Since data transformations that do not change the intrinsic properties of the object cause the majority of the complexity in recognition tasks, models that are invariant to these transformations help reduce the amount of training data required. This further increases the model's efficiency and simplifies training. In this paper, we investigate the generalization of invariant representations on out-of-distribution data and try to answer the question: Do model representations invariant to some transformations in a particular seen domain also remain invariant in previously unseen domains? Through extensive experiments, we demonstrate that the invariant model learns unstructured latent representations that are robust to distribution shifts, thus making invariance a desirable property for training in resource-constrained settings.
Paper Structure (8 sections, 5 figures)

This paper contains 8 sections, 5 figures.

Figures (5)

  • Figure 1: Evaluation framework for pairwise matching
  • Figure 2: Analysis for MNIST and FashionMNIST is shown on the left and right respectively. $X_1$: Training domain, $X_2$: Testing domain (Top) ROC curves for out-of-distribution domain generalization with ${X_1}$ = rotated MNIST/FashionMNIST 0-4, ${X_2}$ = rotated MNIST/FashionMNIST 5-9 (Bottom) Area under the ROC curve (AUC) for verification task as we vary the number of the classes in ${X_1}$ where remaining classes form $X_2$
  • Figure 3: ROC curves for (Left) Face verification on unseen LFW dataset with $X_1, X_2$ described in Section \ref{['impl_details']} (Right) Generalization on completely different OOD data (1) blue, orange - $X_1$ = MNIST, $X_2$ = FashionMNIST (2) green, red - $X_1$ = FashionMNIST, $X_2$ = MNIST
  • Figure 4: Visualization of latent space of (Top Left) Vanilla VAE on $X_1$ = rotated MNIST 0-4 (Top Right) RotInvVAE on $X_1$ = rotated MNIST 0-4 (Bottom Left) Vanilla VAE on the unseen $X_2$ = rotated MNIST 5-9 (Bottom Right) RotInvVAE on the unseen $X_2$ = rotated MNIST 5-9
  • Figure 5: Visualization of latent space of (Top Left) Vanilla VAE on $X_1$ = rotated FashionMNIST 0-4 (Top Right) RotInvVAE on $X_1$ = rotated FashionMNIST 0-4 (Bottom Left) Vanilla VAE on the unseen $X_2$ = rotated FashionMNIST 5-9 (Bottom Right) RotInvVAE on the unseen $X_2$ = rotated FashionMNIST 5-9