Radial-VCReg: More Informative Representation Learning Through Radial Gaussianization
Yilun Kuang, Yash Dagade, Deep Chakraborty, Erik Learned-Miller, Randall Balestriero, Tim G. J. Rudner, Yann LeCun
TL;DR
Radial-VCReg tackles the challenge of maximizing information in self-supervised learning by moving beyond VCReg’s linear dependency regularization. It introduces a radial Gaussianization loss that aligns the feature radius with a Chi distribution, and integrates it into Radial-VICReg to widen the class of distributions that can be Gaussianized. The authors provide theoretical guarantees showing Radial-VCReg strictly expands Gaussianizable distributions compared to VCReg, and they demonstrate consistent empirical gains on synthetic data and real-world image datasets like CIFAR-100, ImageNet-10, and CelebA. This approach offers a principled way to reduce higher-order dependencies and produce more diverse, informative representations with practical implications for downstream tasks.
Abstract
Self-supervised learning aims to learn maximally informative representations, but explicit information maximization is hindered by the curse of dimensionality. Existing methods like VCReg address this by regularizing first and second-order feature statistics, which cannot fully achieve maximum entropy. We propose Radial-VCReg, which augments VCReg with a radial Gaussianization loss that aligns feature norms with the Chi distribution-a defining property of high-dimensional Gaussians. We prove that Radial-VCReg transforms a broader class of distributions towards normality compared to VCReg and show on synthetic and real-world datasets that it consistently improves performance by reducing higher-order dependencies and promoting more diverse and informative representations.
