Beyond Pairwise Correlations: Higher-Order Redundancies in Self-Supervised Representation Learning

David Zollikofer; Béni Egressy; Frederik Benzing; Matthias Otth; Roger Wattenhofer

Beyond Pairwise Correlations: Higher-Order Redundancies in Self-Supervised Representation Learning

David Zollikofer, Béni Egressy, Frederik Benzing, Matthias Otth, Roger Wattenhofer

TL;DR

The paper argues that current self-supervised learning (SSL) largely addresses pairwise redundancy and may overlook higher-order dependencies in embedding spaces. It introduces a formal redundancy framework with measures AAC, LR, and NLR, establishes theoretical relations among them, and presents Self-Supervised Learning with Predictability Minimization (SSLPM), a predictor-based, redundancy-minimizing SSL method. Through extensive experiments across CIFAR-10/100 and ImageNet-100, it shows that reducing linear redundancy (LR) correlates with better downstream performance and that reducing higher-order redundancies yields mixed or negative effects, with SSLPM-RR delivering competitive performance to state-of-the-art baselines. The findings highlight the projector’s role in pruning redundancy and provide a framework for analyzing and guiding SSL design, suggesting potential extensions to other modalities and redundancy metrics.

Abstract

Several self-supervised learning (SSL) approaches have shown that redundancy reduction in the feature embedding space is an effective tool for representation learning. However, these methods consider a narrow notion of redundancy, focusing on pairwise correlations between features. To address this limitation, we formalize the notion of embedding space redundancy and introduce redundancy measures that capture more complex, higher-order dependencies. We mathematically analyze the relationships between these metrics, and empirically measure these redundancies in the embedding spaces of common SSL methods. Based on our findings, we propose Self Supervised Learning with Predictability Minimization (SSLPM) as a method for reducing redundancy in the embedding space. SSLPM combines an encoder network with a predictor engaging in a competitive game of reducing and exploiting dependencies respectively. We demonstrate that SSLPM is competitive with state-of-the-art methods and find that the best performing SSL methods exhibit low embedding space redundancy, suggesting that even methods without explicit redundancy reduction mechanisms perform redundancy reduction implicitly.

Beyond Pairwise Correlations: Higher-Order Redundancies in Self-Supervised Representation Learning

TL;DR

Abstract

Beyond Pairwise Correlations: Higher-Order Redundancies in Self-Supervised Representation Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (16)

Theorems & Definitions (2)