Table of Contents
Fetching ...

Hutchinson's Estimator is Bad at Kronecker-Trace-Estimation

Raphael A. Meyer, Haim Avron

TL;DR

The paper addresses trace estimation for PSD matrices accessible only through Kronecker-matrix-vector products, revealing that the natural Kronecker extension of Hutchinson’s estimator suffers exponential dependence on the Kronecker order $k$ in the worst case. By deriving an exact variance formula involving partial trace and partial transpose operators, the authors show that real-valued Kronecker queries incur a $3^k$-type growth, while complex-valued queries can reduce this to $2^k$, with further improvements for small base dimension $d$ and specialized distributions. They establish tight lower bounds and show that rank-one random matrices can be estimated much faster on average, and they demonstrate that exact trace recovery is possible for Kronecker-structured matrices with only $kd+1$ queries. The results illuminate the intrinsic limitations of the Kronecker-Hutchinson approach and suggest avenues for faster or hybrid algorithms that blend Hutchinson-type estimators with alternative techniques, potentially mitigating the exponential dependence on $k$ in practice. Overall, the work provides a rigorous, nuanced understanding of trace estimation in tensor-structured linear algebra settings and lays groundwork for improved methods when exploiting Kronecker structure.

Abstract

We study the problem of estimating the trace of a matrix $\mathbf{A}$ that can only be accessed through Kronecker-matrix-vector products. That is, for any Kronecker-structured vector $\mathrm{x} = \otimes_{i=1}^k \mathrm{x}_i$, we can compute $\mathbf{A}\mathrm{x}$. We focus on the natural generalization of Hutchinson's Estimator to this setting, proving tight rates for the number of matrix-vector products this estimator needs to find a $(1\pm\varepsilon)$ approximation to the trace of $\mathbf{A}$. We find an exact equation for the variance of the estimator when using a Kronecker of Gaussian vectors, revealing an intimate relationship between Hutchinson's Estimator, the partial trace operator, and the partial transpose operator. Using this equation, we show that when using real vectors, in the worst case, this estimator needs $O(\frac{3^k}{\varepsilon^2})$ products to recover a $(1\pm\varepsilon)$ approximation of the trace of any PSD $\mathbf{A}$, and a matching lower bound for certain PSD $\mathbf{A}$. However, when using complex vectors, this can be exponentially improved to $Θ(\frac{2^k}{\varepsilon^2})$. Further, if the $\mathrm{x}_i$ vectors are low-dimensional and if we instead build $\mathrm{x}$ as the Kronecker product of (scaled) random unit vectors on the complex sphere, then as few as $\frac{1.33^k}{\varepsilon^2}$ samples suffice. We show that Hutchinson's Estimator converges slowest when $\mathbf{A}$ itself also has Kronecker structure. We conclude with some theoretical evidence suggesting that, by combining Hutchinson's Estimator with other techniques, it may be possible to avoid the exponential dependence on $k$.

Hutchinson's Estimator is Bad at Kronecker-Trace-Estimation

TL;DR

The paper addresses trace estimation for PSD matrices accessible only through Kronecker-matrix-vector products, revealing that the natural Kronecker extension of Hutchinson’s estimator suffers exponential dependence on the Kronecker order in the worst case. By deriving an exact variance formula involving partial trace and partial transpose operators, the authors show that real-valued Kronecker queries incur a -type growth, while complex-valued queries can reduce this to , with further improvements for small base dimension and specialized distributions. They establish tight lower bounds and show that rank-one random matrices can be estimated much faster on average, and they demonstrate that exact trace recovery is possible for Kronecker-structured matrices with only queries. The results illuminate the intrinsic limitations of the Kronecker-Hutchinson approach and suggest avenues for faster or hybrid algorithms that blend Hutchinson-type estimators with alternative techniques, potentially mitigating the exponential dependence on in practice. Overall, the work provides a rigorous, nuanced understanding of trace estimation in tensor-structured linear algebra settings and lays groundwork for improved methods when exploiting Kronecker structure.

Abstract

We study the problem of estimating the trace of a matrix that can only be accessed through Kronecker-matrix-vector products. That is, for any Kronecker-structured vector , we can compute . We focus on the natural generalization of Hutchinson's Estimator to this setting, proving tight rates for the number of matrix-vector products this estimator needs to find a approximation to the trace of . We find an exact equation for the variance of the estimator when using a Kronecker of Gaussian vectors, revealing an intimate relationship between Hutchinson's Estimator, the partial trace operator, and the partial transpose operator. Using this equation, we show that when using real vectors, in the worst case, this estimator needs products to recover a approximation of the trace of any PSD , and a matching lower bound for certain PSD . However, when using complex vectors, this can be exponentially improved to . Further, if the vectors are low-dimensional and if we instead build as the Kronecker product of (scaled) random unit vectors on the complex sphere, then as few as samples suffice. We show that Hutchinson's Estimator converges slowest when itself also has Kronecker structure. We conclude with some theoretical evidence suggesting that, by combining Hutchinson's Estimator with other techniques, it may be possible to avoid the exponential dependence on .
Paper Structure (21 sections, 40 theorems, 120 equations, 1 table, 1 algorithm)

This paper contains 21 sections, 40 theorems, 120 equations, 1 table, 1 algorithm.

Key Result

Theorem 1

Fix $d,k\in{\mathbb N}\xspace$, and let $\mathbf{A}\xspace\in{\mathbb R}\xspace^{d^k \times d^k}$. Let $\boldsymbol{\mathrm{x}}\xspace = \boldsymbol{\mathrm{x}}\xspace_1 \otimes \cdots \otimes \boldsymbol{\mathrm{x}}\xspace_k$, where each $\boldsymbol{\mathrm{x}}\xspace_i$ is independent and is eith

Theorems & Definitions (72)

  • Theorem 1
  • Theorem 2
  • Corollary 3
  • Theorem 4
  • Theorem 5
  • Corollary 6
  • Theorem 7
  • Theorem 8
  • Theorem 9
  • Theorem 10
  • ...and 62 more