Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation

Andrei Jelea; Ahmed Nabil Belbachir; Marius Leordeanu

Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation

Andrei Jelea, Ahmed Nabil Belbachir, Marius Leordeanu

TL;DR

The paper presents Generalized Test-Time Augmentation (GTTA), a PCA-subspace perturbation technique that generates diverse, data-distribution-consistent test samples via Gaussian noise and averages their model outputs to improve performance across vision and non-vision tasks. It provides theoretical guarantees showing that GTTA can reduce the initial error and increase transformation diversity, while also removing structured noise through subspace decorrelation. A key innovation is a self-supervised distillation stage where the GTTA ensemble teaches a single model on unlabeled data, achieving ensemble-like accuracy with a single forward pass. The approach is validated on varied tasks (classification, segmentation, regression, speech) and challenging domains (underwater fish segmentation with the DeepSalmon dataset), highlighting GTTA’s generality, uncertainty-informed weighting of pseudo-labels, and practical test-time efficiency.

Abstract

We introduce Generalized Test-Time Augmentation (GTTA), a highly effective method for improving the performance of a trained model, which unlike other existing Test-Time Augmentation approaches from the literature is general enough to be used off-the-shelf for many vision and non-vision tasks, such as classification, regression, image segmentation and object detection. By applying a new general data transformation, that randomly perturbs multiple times the PCA subspace projection of a test input, GTTA creates valid augmented samples from the data distribution with high diversity, properties we theoretically show that are essential for a Test-Time Augmentation method to be effective. Different from other existing methods, we also propose a final self-supervised learning stage in which the ensemble output, acting as an unsupervised teacher, is used to train the initial single student model, thus reducing significantly the test time computational cost. Our comparisons to strong TTA approaches and SoTA models on various vision and non-vision well-known datasets and tasks, such as image classification and segmentation, pneumonia detection, speech recognition and house price prediction, validate the generality of the proposed GTTA. Furthermore, we also prove its effectiveness on the more specific real-world task of salmon segmentation and detection in low-visibility underwater videos, for which we introduce DeepSalmon, the largest dataset of its kind in the literature.

Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation

TL;DR

Abstract

Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (3)