Whitening Consistently Improves Self-Supervised Learning
András Kalapos, Bálint Gyires-Tóth
TL;DR
This work investigates how ZCA whitening, applied as the final layer of the encoder, can universally improve self-supervised learning (SSL) representations across diverse methods and architectures. By implementing whitening via differentiable Iterative Normalization and evaluating with both linear and k-NN probes, the authors demonstrate consistent performance gains and faster convergence, with BYOL showing the strongest improvements and a notable case where whitening prevents collapse. In addition to empirical results, they introduce feature-space metrics—mean absolute feature correlation, mean feature standard deviation, and anisotropy—to analyze representation quality and collapse patterns. The findings suggest whitening is a practical, broadly applicable post-encoder enhancement for SSL that yields meaningful gains with modest compute overhead, supported by public code and a framework for interpretable feature analysis.
Abstract
Self-supervised learning (SSL) has been shown to be a powerful approach for learning visual representations. In this study, we propose incorporating ZCA whitening as the final layer of the encoder in self-supervised learning to enhance the quality of learned features by normalizing and decorrelating them. Although whitening has been utilized in SSL in previous works, its potential to universally improve any SSL model has not been explored. We demonstrate that adding whitening as the last layer of SSL pretrained encoders is independent of the self-supervised learning method and encoder architecture, thus it improves performance for a wide range of SSL methods across multiple encoder architectures and datasets. Our experiments show that whitening is capable of improving linear and k-NN probing accuracy by 1-5%. Additionally, we propose metrics that allow for a comprehensive analysis of the learned features, provide insights into the quality of the representations and help identify collapse patterns.
