Contrastive Factor Analysis
Zhibin Duan, Tiansheng Wen, Yifei Wang, Chen Zhu, Bo Chen, Mingyuan Zhou
TL;DR
This work identifies a gap between traditional factor analysis and modern contrastive learning and proposes Contrastive Factor Analysis (CFA) to merge their strengths by factorizing a normalized co-occurrence matrix with latent Gaussian factors. It generalizes to a non-negative version (CNFA) using Gamma latents for disentangled representations, and leverages variational inference with Gaussian and Weibull posteriors to learn posterior distributions efficiently. A CL-inspired reformulation yields a tractable objective suitable for gradient-based optimization, while explicit uncertainty measures are derived from posterior entropies. Across in-distribution and out-of-distribution tasks, CFA/CNFA demonstrate improved expressiveness, robustness, interpretability, and calibrated uncertainty estimation, highlighting their potential for robust unsupervised representation learning in the deep-learning era.
Abstract
Factor analysis, often regarded as a Bayesian variant of matrix factorization, offers superior capabilities in capturing uncertainty, modeling complex dependencies, and ensuring robustness. As the deep learning era arrives, factor analysis is receiving less and less attention due to their limited expressive ability. On the contrary, contrastive learning has emerged as a potent technique with demonstrated efficacy in unsupervised representational learning. While the two methods are different paradigms, recent theoretical analysis has revealed the mathematical equivalence between contrastive learning and matrix factorization, providing a potential possibility for factor analysis combined with contrastive learning. Motivated by the interconnectedness of contrastive learning, matrix factorization, and factor analysis, this paper introduces a novel Contrastive Factor Analysis framework, aiming to leverage factor analysis's advantageous properties within the realm of contrastive learning. To further leverage the interpretability properties of non-negative factor analysis, which can learn disentangled representations, contrastive factor analysis is extended to a non-negative version. Finally, extensive experimental validation showcases the efficacy of the proposed contrastive (non-negative) factor analysis methodology across multiple key properties, including expressiveness, robustness, interpretability, and accurate uncertainty estimation.
