Where Did Your Model Learn That? Label-free Influence for Self-supervised Learning

Nidhin Harilal; Amit Kiran Rege; Reza Akbarian Bafghi; Maziar Raissi; Claire Monteleoni

Where Did Your Model Learn That? Label-free Influence for Self-supervised Learning

Nidhin Harilal, Amit Kiran Rege, Reza Akbarian Bafghi, Maziar Raissi, Claire Monteleoni

TL;DR

Influence-SSL introduces a label-free data attribution framework for self-supervised learning by measuring how training points affect their own representations through augmentation-induced stability, grounded in an invariance-distinctiveness trade-off. The authors derive a theoretical foundation in a linear setting, showing that influence corresponds to augmentation sensitivity and representation changes, and validate the approach across SimCLR, BYOL, and Barlow Twins on CIFAR-10/100 and FairFace, demonstrating stability and practical utility. Key findings include that high-influence points can hinder SSL learning via background biases, and that removing such points can improve downstream performance, while influence analysis also enables detection of semantic duplicates and fairness-related biases. The work provides a data-centric tool for SSL analysis with implications for data curation, robustness, and fairness in large-scale representation learning.

Abstract

Self-supervised learning (SSL) has revolutionized learning from large-scale unlabeled datasets, yet the intrinsic relationship between pretraining data and the learned representations remains poorly understood. Traditional supervised learning benefits from gradient-based data attribution tools like influence functions that measure the contribution of an individual data point to model predictions. However, existing definitions of influence rely on labels, making them unsuitable for SSL settings. We address this gap by introducing Influence-SSL, a novel and label-free approach for defining influence functions tailored to SSL. Our method harnesses the stability of learned representations against data augmentations to identify training examples that help explain model predictions. We provide both theoretical foundations and empirical evidence to show the utility of Influence-SSL in analyzing pre-trained SSL models. Our analysis reveals notable differences in how SSL models respond to influential data compared to supervised models. Finally, we validate the effectiveness of Influence-SSL through applications in duplicate detection, outlier identification and fairness analysis. Code is available at: \url{https://github.com/cryptonymous9/Influence-SSL}.

Where Did Your Model Learn That? Label-free Influence for Self-supervised Learning

TL;DR

Abstract

Where Did Your Model Learn That? Label-free Influence for Self-supervised Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (16)

Theorems & Definitions (37)