Table of Contents
Fetching ...

Shrinkage priors for circulant correlation structure models

Michiko Okudo, Tomonari Sei

TL;DR

This paper introduces the circulant correlation structure model for multivariate Gaussian data, where $\Sigma = D_\alpha R D_\alpha$ with $R = Q D_\lambda Q^*$ and a scale constraint $\prod_i \lambda_i = 1$. It develops shrinkage priors that damp the non-eigenvalue part of $\Sigma$ via a log-linear $\lambda(\theta)$ parametrization, and proves that Bayesian predictive densities based on these priors asymptotically outperform those based on Jeffreys priors under KL risk. The authors establish optimality results in both a full circulant model and an exchangeable submodel, showing that a uniform shrinkage prior $\pi_S$ is optimal within natural prior families in the high-$\theta$ limit, and that, in the exchangeable case, a family of shrinkage priors yields asymptotic dominance with an explicit optimal setting $\gamma = -1/4$. Numerical illustrations corroborate the theoretical gains, highlighting improved predictive performance due to shrinkage of the non-eigenvalue covariance structure. Overall, the work provides a principled, asymptotically optimal approach to shrinkage in circulant-correlation models with practical implications for multivariate prediction under structured covariance.

Abstract

We consider a new statistical model called the circulant correlation structure model, which is a multivariate Gaussian model with unknown covariance matrix and has a scale-invariance property. We construct shrinkage priors for the circulant correlation structure models and show that Bayesian predictive densities based on those priors asymptotically dominate Bayesian predictive densities based on Jeffreys priors under the Kullback-Leibler (KL) risk function. While shrinkage of eigenvalues of covariance matrices of Gaussian models has been successful, the proposed priors shrink a non-eigenvalue part of covariance matrices.

Shrinkage priors for circulant correlation structure models

TL;DR

This paper introduces the circulant correlation structure model for multivariate Gaussian data, where with and a scale constraint . It develops shrinkage priors that damp the non-eigenvalue part of via a log-linear parametrization, and proves that Bayesian predictive densities based on these priors asymptotically outperform those based on Jeffreys priors under KL risk. The authors establish optimality results in both a full circulant model and an exchangeable submodel, showing that a uniform shrinkage prior is optimal within natural prior families in the high- limit, and that, in the exchangeable case, a family of shrinkage priors yields asymptotic dominance with an explicit optimal setting . Numerical illustrations corroborate the theoretical gains, highlighting improved predictive performance due to shrinkage of the non-eigenvalue covariance structure. Overall, the work provides a principled, asymptotically optimal approach to shrinkage in circulant-correlation models with practical implications for multivariate prediction under structured covariance.

Abstract

We consider a new statistical model called the circulant correlation structure model, which is a multivariate Gaussian model with unknown covariance matrix and has a scale-invariance property. We construct shrinkage priors for the circulant correlation structure models and show that Bayesian predictive densities based on those priors asymptotically dominate Bayesian predictive densities based on Jeffreys priors under the Kullback-Leibler (KL) risk function. While shrinkage of eigenvalues of covariance matrices of Gaussian models has been successful, the proposed priors shrink a non-eigenvalue part of covariance matrices.

Paper Structure

This paper contains 6 sections, 7 theorems, 63 equations, 1 figure.

Key Result

Lemma 1.1

Under the restriction (eq:det-one), the parameters $\alpha$ and $\lambda$ are identifiable from $\Sigma$.

Figures (1)

  • Figure 1: KL risk difference ${E}[D(p(y;\theta,\beta);\hat{p}_J )]- {E}[D(p(y;\theta,\beta);\hat{p}_\gamma )]$ for $p=2,3,10$ and $\gamma=-1/2,-1/4,-1/100$

Theorems & Definitions (14)

  • Lemma 1.1
  • proof
  • Lemma 2.1
  • proof
  • Theorem 2.1
  • proof
  • Lemma 2.2
  • proof
  • Theorem 3.1
  • proof
  • ...and 4 more