Bidirectional Variational Autoencoders
Bart Kosko, Olaoluwa Adigun
TL;DR
Bidirectional Variational Autoencoders (BVAE) replace the conventional encoder–decoder pair with a single network that encodes in the forward direction and decodes in the backward direction via shared weights, trained with Bidirectional Backpropagation (B-BP). This yields the BELBO objective, where $\mathcal{L}_{BELBO}(x,\theta)=\mathbb{E}_{z|x,\theta}[\ln p(x|z,\theta)] - D_{KL}(q_f(z|x,\theta)\| p(z|\theta))$, providing a bound on $\ln p(x|\theta)$ and enabling joint optimization of directional likelihoods without a separate encoder network. Empirical results on MNIST, Fashion-MNIST, CIFAR-10, and CelebA-64 show that BVAE matches or modestly exceeds the performance of unidirectional VAEs while reducing parameter counts by about 50%, demonstrating that bidirectional inference with a single network can maintain generative quality and improve efficiency for image synthesis and compression tasks.
Abstract
We present the new bidirectional variational autoencoder (BVAE) network architecture. The BVAE uses a single neural network both to encode and decode instead of an encoder-decoder network pair. The network encodes in the forward direction and decodes in the backward direction through the same synaptic web. Simulations compared BVAEs and ordinary VAEs on the four image tasks of image reconstruction, classification, interpolation, and generation. The image datasets included MNIST handwritten digits, Fashion-MNIST, CIFAR-10, and CelebA-64 face images. The bidirectional structure of BVAEs cut the parameter count by almost 50% and still slightly outperformed the unidirectional VAEs.
