Discovering Influential Factors in Variational Autoencoders

Shiqi Liu; Jingxin Liu; Qian Zhao; Xiangyong Cao; Huibin Li; Deyu Meng; Hongying Meng; Sheng Liu

Discovering Influential Factors in Variational Autoencoders

Shiqi Liu, Jingxin Liu, Qian Zhao, Xiangyong Cao, Huibin Li, Deyu Meng, Hongying Meng, Sheng Liu

TL;DR

The paper argues that mutual information between inputs and latent VAEs factors is a necessary indicator of influence, showing that VAE objectives induce MI sparsity which can cause non-influential factors to be ignored. It introduces a consistent MI estimator to quantify I(X; Z_enc_h) and demonstrates, across MNIST, CelebA, and DEAP, that top MI factors are often interpretable and useful for generation and downstream classification, including emotion-related traits. The work provides theoretical connections to reconstruction bounds and classification error, and presents practical algorithms for estimating MI and identifying influential factors, enabling more efficient, interpretable latent representations across VAE variants.

Abstract

In the field of machine learning, it is still a critical issue to identify and supervise the learned representation without manually intervening or intuition assistance to extract useful knowledge or serve for the downstream tasks. In this work, we focus on supervising the influential factors extracted by the variational autoencoder(VAE). The VAE is proposed to learn independent low dimension representation while facing the problem that sometimes pre-set factors are ignored. We argue that the mutual information of the input and each learned factor of the representation plays a necessary indicator of discovering the influential factors. We find the VAE objective inclines to induce mutual information sparsity in factor dimension over the data intrinsic dimension and therefore result in some non-influential factors whose function on data reconstruction could be ignored. We show mutual information also influences the lower bound of the VAE's reconstruction error and downstream classification task. To make such indicator applicable, we design an algorithm for calculating the mutual information for the VAE and prove its consistency. Experimental results on MNIST, CelebA and DEAP datasets show that mutual information can help determine influential factors, of which some are interpretable and can be used to further generation and classification tasks, and help discover the variant that connects with emotion on DEAP dataset.

Discovering Influential Factors in Variational Autoencoders

TL;DR

Abstract

Discovering Influential Factors in Variational Autoencoders

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)

Theorems & Definitions (4)