Bayesian Matrix Decomposition and Applications
Jun Lu
TL;DR
The work surveys Bayesian matrix decomposition (BMD) as a probabilistic framework for factorizing matrices while quantifying uncertainty and incorporating prior knowledge. It builds from foundational linear algebra (four fundamental subspaces and SVD) to Bayesian inference, covering Bayes’ theorem, model evidence, and approximate methods (Laplace, BIC, Occam’s razor) before detailing MC and variational approaches (MCMC, Gibbs, ARS, ELBO, VI, and amortized VI). It culminates with a repertoire of conjugate priors and standard Bayesian linear models to illustrate analytical tractability and model updating. The practical impact lies in providing a self-contained, rigorous primer that enables Bayesian matrix factorizations (e.g., real-valued, NMF, Bayesian interpolative decomposition) and informed model selection for tasks like matrix completion, denoising, and structured inference, while clearly indicating scope limits and directions for deeper Bayesian study.
Abstract
The sole aim of this book is to give a self-contained introduction to concepts and mathematical tools in Bayesian matrix decomposition in order to seamlessly introduce matrix decomposition techniques and their applications in subsequent sections. However, we clearly realize our inability to cover all the useful and interesting results concerning Bayesian matrix decomposition and given the paucity of scope to present this discussion, e.g., the separated analysis of variational inference for conducting the optimization. We refer the reader to literature in the field of Bayesian analysis for a more detailed introduction to the related fields. This book is primarily a summary of purpose, significance of important Bayesian matrix decomposition methods, e.g., real-valued decomposition, nonnegative matrix factorization, Bayesian interpolative decomposition, and the origin and complexity of the methods which shed light on their applications. The mathematical prerequisite is a first course in statistics and linear algebra. Other than this modest background, the development is self-contained, with rigorous proof provided throughout.
