Generative modeling through internal high-dimensional chaotic activity
Samantha J. Fournier, Pierfrancesco Urbani
TL;DR
Generative modeling aims to sample from a Boltzmann distribution with energy $E$, typically requiring gradient estimation on the Log-Likelihood ($LL$) that is hampered by slow MCMC mixing. The authors demonstrate that chaotic dynamics in high-dimensional recurrent networks, trained with contrastive Hebbian updates to learn a symmetric perturbation $A$ and biases $b$, can serve as autonomous generative noise. On MNIST and Fashion-MNIST, the 2- and 3-layer restricted models produce plausible samples and achieve competitive statistics via four measures $\mathcal{E}^{(2)}$, $\mathcal{E}^{(s)}$, $\mathcal{E}^{(R)}$, and $\mathcal{E}^{(AAI)}$, with training conducted completely without external noise. The work suggests a path toward biologically inspired generative models and motivates theoretical analyses with dynamical mean-field theory.
Abstract
Generative modeling aims at producing new datapoints whose statistical properties resemble the ones in a training dataset. In recent years, there has been a burst of machine learning techniques and settings that can achieve this goal with remarkable performances. In most of these settings, one uses the training dataset in conjunction with noise, which is added as a source of statistical variability and is essential for the generative task. Here, we explore the idea of using internal chaotic dynamics in high-dimensional chaotic systems as a way to generate new datapoints from a training dataset. We show that simple learning rules can achieve this goal within a set of vanilla architectures and characterize the quality of the generated datapoints through standard accuracy measures.
