Adversarial Dependence Minimization
Pierre-François De Plaen, Tinne Tuytelaars, Marc Proesmans, Luc Van Gool
TL;DR
The paper tackles nonlinear dependencies among learned embedding dimensions by introducing Adversarial Dependence Minimization (ADMin), a differentiable, scalable training objective where small dependency predictors try to reconstruct a dimension from the others while an encoder counteracts to minimize dependencies. Through a standardized formulation and a margin variant, ADMin converges to minimally dependent representations and supports practical extensions such as nonlinear PCA (PICA), improved classifier generalization, and prevention of dimensional collapse in self-supervised learning. Empirical results show convergence on TinyImageNet and ImageNet, reduced distance correlations, and improved generalization on synthetic and real-world tasks, though SSL on ImageNet indicates room for improvement relative to state-of-the-art methods. The approach provides a flexible regularizer that can be integrated into various learning paradigms to encourage richer, less redundant representations with potential benefits for generalization and robustness.
Abstract
Many machine learning techniques rely on minimizing the covariance between output feature dimensions to extract minimally redundant representations from data. However, these methods do not eliminate all dependencies/redundancies, as linearly uncorrelated variables can still exhibit nonlinear relationships. This work provides a differentiable and scalable algorithm for dependence minimization that goes beyond linear pairwise decorrelation. Our method employs an adversarial game where small networks identify dependencies among feature dimensions, while the encoder exploits this information to reduce dependencies. We provide empirical evidence of the algorithm's convergence and demonstrate its utility in three applications: extending PCA to nonlinear decorrelation, improving the generalization of image classification methods, and preventing dimensional collapse in self-supervised representation learning.
