Prior distributions for structured semi-orthogonal matrices
Michael Jauch, Marie-Christine Düker, Peter Hoff
TL;DR
This work develops a Bayesian framework for priors on structured semi-orthogonal matrices $Q \in \mathcal{V}(k,p)$ by projecting an unconstrained matrix $X$ onto the Stiefel manifold and employing the MACG prior as a core component. It establishes invariance and Wasserstein-approximation results showing that structure embedded in $X$ transfers to $Q$, enabling explicit sparsity or smoothness through choices of $Z$ and a correlation matrix $\Omega$. The authors illustrate two applications—a sparse network eigenmodel for protein interactions and a smooth PCA for ocean oxygen data—demonstrating improved interpretability with competitive predictive performance. Computationally, posterior inference is achieved via parameter-expanded MCMC using polar expansion, aided by a scale-mixture representation of the shrinkage prior and MACG framework. This approach provides a principled, tractable way to incorporate domain-specific structure into semi-orthogonal matrix parameters across multivariate models.
Abstract
Statistical models for multivariate data often include a semi-orthogonal matrix parameter. In many applications, there is reason to expect that the semi-orthogonal matrix parameter satisfies a structural assumption such as sparsity or smoothness. From a Bayesian perspective, these structural assumptions should be incorporated into an analysis through the prior distribution. In this work, we introduce a general approach to constructing prior distributions for structured semi-orthogonal matrices that leads to tractable posterior inference via parameter-expanded Markov chain Monte Carlo. We draw on recent results from random matrix theory to establish a theoretical basis for the proposed approach. We then introduce specific prior distributions for incorporating sparsity or smoothness and illustrate their use through applications to biological and oceanographic data.
