Table of Contents
Fetching ...

Near-Efficient and Non-Asymptotic Multiway Inference

Oscar López, Arvind Prasadan, Carlos Llosa-Vite, Richard B. Lehoucq, Daniel M. Dunlavy

TL;DR

This work develops a non-asymptotic, CRLB-based framework to benchmark tensor CP-based inference for Poisson count data. It shows that, in the rank-one CP setting, a shifted-Poisson regression estimator achieves near-efficiency with variance close to the CRLB up to absolute constants and logarithmic factors, while higher CP ranks exhibit a CRLB gap for factor estimation though full parametric inference remains nearly minimax optimal with favorable rank dependence. The authors derive non-asymptotic MSE and Fisher information bounds, provide a minimax lower bound, and validate the theory with numerical experiments, clarifying when CP-based multiway analysis is reliable in finite samples. The results offer practical non-asymptotic benchmarks for CP-based inference in high-dimensional tensor Poisson models and motivate future work on higher-rank efficient estimators and broader noisy tensor settings.

Abstract

We establish non-asymptotic efficiency guarantees for tensor decomposition-based inference in count data models. Under a Poisson framework, we consider two related goals: (i) parametric inference, the estimation of the full distributional parameter tensor, and (ii) multiway analysis, the recovery of its canonical polyadic (CP) decomposition factors. Our main result shows that in the rank-one setting, a rank-constrained maximum-likelihood estimator achieves multiway analysis with variance matching the Cramér-Rao Lower Bound (CRLB) up to absolute constants and logarithmic factors. This provides a general framework for studying "near-efficient" multiway estimators in finite-sample settings. For higher ranks, we illustrate that our multiway estimator may not attain the CRLB; nevertheless, CP-based parametric inference remains nearly minimax optimal, with error bounds that improve on prior work by offering more favorable dependence on the CP rank. Numerical experiments corroborate near-efficiency in the rank-one case and highlight the efficiency gap in higher-rank scenarios.

Near-Efficient and Non-Asymptotic Multiway Inference

TL;DR

This work develops a non-asymptotic, CRLB-based framework to benchmark tensor CP-based inference for Poisson count data. It shows that, in the rank-one CP setting, a shifted-Poisson regression estimator achieves near-efficiency with variance close to the CRLB up to absolute constants and logarithmic factors, while higher CP ranks exhibit a CRLB gap for factor estimation though full parametric inference remains nearly minimax optimal with favorable rank dependence. The authors derive non-asymptotic MSE and Fisher information bounds, provide a minimax lower bound, and validate the theory with numerical experiments, clarifying when CP-based multiway analysis is reliable in finite samples. The results offer practical non-asymptotic benchmarks for CP-based inference in high-dimensional tensor Poisson models and motivate future work on higher-rank efficient estimators and broader noisy tensor settings.

Abstract

We establish non-asymptotic efficiency guarantees for tensor decomposition-based inference in count data models. Under a Poisson framework, we consider two related goals: (i) parametric inference, the estimation of the full distributional parameter tensor, and (ii) multiway analysis, the recovery of its canonical polyadic (CP) decomposition factors. Our main result shows that in the rank-one setting, a rank-constrained maximum-likelihood estimator achieves multiway analysis with variance matching the Cramér-Rao Lower Bound (CRLB) up to absolute constants and logarithmic factors. This provides a general framework for studying "near-efficient" multiway estimators in finite-sample settings. For higher ranks, we illustrate that our multiway estimator may not attain the CRLB; nevertheless, CP-based parametric inference remains nearly minimax optimal, with error bounds that improve on prior work by offering more favorable dependence on the CP rank. Numerical experiments corroborate near-efficiency in the rank-one case and highlight the efficiency gap in higher-rank scenarios.

Paper Structure

This paper contains 23 sections, 19 theorems, 99 equations, 5 figures.

Key Result

Theorem 1

Consider an unbiased estimator $\hat{\boldsymbol \theta}(\boldsymbol x)$ of $\boldsymbol \theta$ (i.e., $\mathbb E\hat{\boldsymbol \theta}=\boldsymbol \theta$), where the PDF satisfies the following interchanging of integral and derivative Define the Fisher information matrix (FIM) as Then where is the covariance matrix, $A\preceq B$ means that $B-A$ is positive semi-definite (PSD), and the $\

Figures (5)

  • Figure 1: Rank-one experiments: dimension sizes ($I$) vs mean value of tr$\left(\mathcal{I}_{\boldsymbol \theta}^{\dagger}\right)$ and $\|\boldsymbol \theta-\widetilde{\boldsymbol \theta}\|_2^2$, with error bars. Left plot shows the $N=3$ dimensional case and right plot shows $N=4$.
  • Figure 2: Rank-2 experiments: dimension sizes ($I$) vs mean value of tr$\left(\mathcal{I}_{\boldsymbol \theta}^{\dagger}\right)$ and $\|\boldsymbol \theta-\widetilde{\boldsymbol \theta}\|_2^2$, with error bars. Left plot shows the $N=3$ dimensional case and right plot shows $N=4$.
  • Figure 3: Rank-5 experiments: dimension sizes ($I$) vs mean value of tr$\left(\mathcal{I}_{\boldsymbol \theta}^{\dagger}\right)$ and $\|\boldsymbol \theta-\widetilde{\boldsymbol \theta}\|_2^2$, with error bars. Left plot shows the $N=3$ dimensional case and right plot shows $N=4$.
  • Figure 4: $4^{\hbox{th}}$ order quantile plots: left plot shows the rank-2 case with median and a 0–90% quantile band and the right plot shows the rank-5 case with median and a 0–65% quantile band.
  • Figure :

Theorems & Definitions (30)

  • Theorem 1
  • Definition 1
  • Theorem 2
  • proof : Proof of Theorem \ref{['CRLBdist']}
  • Definition 2
  • Theorem 3: rank-one multiway analysis
  • Corollary 1
  • Theorem 4: Rank-$R$ parametric inference
  • Theorem 4: Rank-$R$ parametric inference
  • Theorem 5: Minimax lower bound
  • ...and 20 more