Kronecker sum covariance models for spatio-temporal data

Shuheng Zhou; Seyoung Park; Kerby Shedden

Kronecker sum covariance models for spatio-temporal data

Shuheng Zhou, Seyoung Park, Kerby Shedden

TL;DR

This work develops graphical-model-style inference for matrix-variate data with measurement errors by introducing a non-separable Kronecker sum covariance model $A \oplus B = A \otimes I_n + I_m \otimes B$ and observing $X = X_0 + W$, with ${\rm vec}(X) \sim N(0, A \oplus B)$. Identifiability is achieved by fixing ${\rm tr}(A)$ and estimating inverse covariances $\Theta = A^{-1}$ and $\Phi = B^{-1}$ via a corrected Lasso in a network of nodewise regressions, without assuming i.i.d. or Gaussian data. The paper establishes convergence rates under subgaussian assumptions and Restricted Eigenvalue-type conditions, and provides a PSD-correction option, situating the KS approach relative to Kronecker-product models. Practically, the framework enables reliable inference of two-way dependencies in high-dimensional matrix data with measurement errors across domains such as neuroscience and genetics.

Abstract

In this paper, we study the subgaussian matrix variate model, where we observe the matrix variate data $X$ which consists of a signal matrix $X_0$ and a noise matrix $W$. More specifically, we study a subgaussian model using the Kronecker sum covariance as in Rudelson and Zhou (2017). Let $Z_1, Z_2$ be independent copies of a subgaussian random matrix $Z =(Z_{ij})$, where $Z_{ij}, \forall i, j$ are independent mean 0, unit variance, subgaussian random variables with bounded $ψ_2$ norm. We use $X \sim \mathcal{M}_{n,m}(0, A \oplus B)$ to denote the subgaussian random matrix $X_{n \times m}$ which is generated using: $$ X = Z_1 A^{1/2} + B^{1/2} Z_2. $$ In this covariance model, the first component $A \otimes I_n$ describes the covariance of the signal $X_0 = Z_1 A^{1/2}$, which is an ${n \times m}$ random design matrix with independent subgaussian row vectors, and the other component $I_m \otimes B$ describes the covariance for the noise matrix $W =B^{1/2} Z_2$, which contains independent subgaussian column vectors $w^1, \ldots, w^m$, independent of $X_0$. This leads to a non-separable class of models for the observation $X$, which we denote by $X \sim \mathcal{M}_{n,m}(0, A \oplus B)$ throughout this paper. Our method on inverse covariance estimation corresponds to the proposal in Yuan (2010) and Loh and Wainwright (2012), only now dropping the i.i.d. or Gaussian assumptions. We present the statistical rates of convergence.

Kronecker sum covariance models for spatio-temporal data

TL;DR

This work develops graphical-model-style inference for matrix-variate data with measurement errors by introducing a non-separable Kronecker sum covariance model

and observing

, with

. Identifiability is achieved by fixing

and estimating inverse covariances

and

via a corrected Lasso in a network of nodewise regressions, without assuming i.i.d. or Gaussian data. The paper establishes convergence rates under subgaussian assumptions and Restricted Eigenvalue-type conditions, and provides a PSD-correction option, situating the KS approach relative to Kronecker-product models. Practically, the framework enables reliable inference of two-way dependencies in high-dimensional matrix data with measurement errors across domains such as neuroscience and genetics.

Abstract

In this paper, we study the subgaussian matrix variate model, where we observe the matrix variate data

which consists of a signal matrix

and a noise matrix

. More specifically, we study a subgaussian model using the Kronecker sum covariance as in Rudelson and Zhou (2017). Let

be independent copies of a subgaussian random matrix

, where

are independent mean 0, unit variance, subgaussian random variables with bounded

norm. We use

to denote the subgaussian random matrix

which is generated using:

In this covariance model, the first component

describes the covariance of the signal

, which is an

random design matrix with independent subgaussian row vectors, and the other component

describes the covariance for the noise matrix

, which contains independent subgaussian column vectors

, independent of

. This leads to a non-separable class of models for the observation

, which we denote by

throughout this paper. Our method on inverse covariance estimation corresponds to the proposal in Yuan (2010) and Loh and Wainwright (2012), only now dropping the i.i.d. or Gaussian assumptions. We present the statistical rates of convergence.

Kronecker sum covariance models for spatio-temporal data

TL;DR

Abstract

Kronecker sum covariance models for spatio-temporal data

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (16)