Exponential Convergence of CAVI for Bayesian PCA

Arghya Datta; Philippe Gagnon; Florian Maire

Exponential Convergence of CAVI for Bayesian PCA

Arghya Datta, Philippe Gagnon, Florian Maire

TL;DR

A precise exponential convergence result is proved in the case where the model uses a single principal component (PC) and it is indicated that traditional PCA is retrieved as points estimates of the BPCA parameters.

Abstract

Probabilistic principal component analysis (PCA) and its Bayesian variant (BPCA) are widely used for dimension reduction in machine learning and statistics. The main advantage of probabilistic PCA over the traditional formulation is allowing uncertainty quantification. The parameters of BPCA are typically learned using mean-field variational inference, and in particular, the coordinate ascent variational inference (CAVI) algorithm. So far, the convergence speed of CAVI for BPCA has not been characterized. In our paper, we fill this gap in the literature. Firstly, we prove a precise exponential convergence result in the case where the model uses a single principal component (PC). Interestingly, this result is established through a connection with the classical $\textit{power iteration algorithm}$ and it indicates that traditional PCA is retrieved as points estimates of the BPCA parameters. Secondly, we leverage recent tools to prove exponential convergence of CAVI for the model with any number of PCs, thus leading to a more general result, but one that is of a slightly different flavor. To prove the latter result, we additionally needed to introduce a novel lower bound for the symmetric Kullback--Leibler divergence between two multivariate normal distributions, which, we believe, is of independent interest in information theory.

Exponential Convergence of CAVI for Bayesian PCA

TL;DR

Abstract

Exponential Convergence of CAVI for Bayesian PCA

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (43)