Rank Reduction Autoencoders
Jad Mounayer, Sebastian Rodriguez, Chady Ghnatios, Charbel Farhat, Francisco Chinesta
TL;DR
Rank Reduction Autoencoders (RRAEs) regularize latent spaces by integrating a truncated SVD into the forward pass, making the bottleneck a function of the latent-space rank rather than a fixed dimension. The adaptive variant aRRAEs automatically discovers the optimal bottleneck during training by pruning the least important singular value and redistributing its influence, without adding extra learning hyperparameters. Empirical results on synthetic data, MNIST, Fashion-MNIST, and CelebA show that RRAEs and aRRAEs yield stable training, scalable bottlenecks, and latent spaces conducive to interpolation and realistic sample generation, outperforming vanilla and other regularizing autoencoders. The work demonstrates that a rank-based bottleneck can provide more meaningful representations and simpler deployment, with practical impact on unsupervised representation learning and generative tasks, especially where data manifolds are complex or high-dimensional.
Abstract
The choice of an appropriate bottleneck dimension and the application of effective regularization are both essential for Autoencoders to learn meaningful representations from unlabeled data. In this paper, we introduce a new class of deterministic autoencoders, Rank Reduction Autoencoders (RRAEs), which regularize their latent spaces by employing a truncated singular value decomposition (SVD) during training. In RRAEs, the bottleneck is defined by the rank of the latent matrix, thereby alleviating the dependence of the encoder/decoder architecture on the bottleneck size. This approach enabled us to propose an adaptive algorithm (aRRAEs) that efficiently determines the optimal bottleneck size during training. We empirically demonstrate that both RRAEs and aRRAEs are stable, scalable, and reliable, as they do not introduce any additional training hyperparameters. We evaluate our proposed architecture on a synthetic data set, as well as on MNIST, Fashion MNIST, and CelebA. Our results show that RRAEs offer several advantages over Vanilla AEs with both large and small latent spaces, and outperform other regularizing AE architectures.
