Variational Recurrent Auto-Encoders
Otto Fabius, Joost R. van Amersfoort
TL;DR
This work introduces the Variational Recurrent Auto-Encoder (VRAE), which fuses RNNs with Stochastic Gradient Variational Bayes to learn latent representations of time-series data in an unsupervised, scalable manner and to generate sequence data. The encoder maps sequences to a Gaussian latent distribution, while the decoder reconstructs data from samples drawn via a reparameterized latent variable, enabling efficient gradient-based training. Experiments on MIDI-like data demonstrate both low-dimensional and high-dimensional latent spaces, revealing structured latent organization and the ability to interpolate and generate longer sequences. The authors also argue that VRAE provides useful initializations for standard RNNs, potentially improving training stability and performance for supervised tasks on sequential data.
Abstract
In this paper we propose a model that combines the strengths of RNNs and SGVB: the Variational Recurrent Auto-Encoder (VRAE). Such a model can be used for efficient, large scale unsupervised learning on time series data, mapping the time series data to a latent vector representation. The model is generative, such that data can be generated from samples of the latent space. An important contribution of this work is that the model can make use of unlabeled data in order to facilitate supervised training of RNNs by initialising the weights and network state.
