The AL$\ell_0$CORE Tensor Decomposition for Sparse Count Data
John Hood, Aaron Schein
TL;DR
AL$\\ell_0$CORE introduces a Tucker-like tensor decomposition with a sparsity budget on the core, enabling a rich latent structure while avoiding Tucker's exponential parameter growth. By allocating at most $Q$ non-zero core entries and sampling their locations during inference, the method scales with $||\\boldsymbol{\\Lambda}||_0$ and is well-suited for sparse count data under a Poisson likelihood. In experiments on large dynamic multilayer networks (e.g., TERRIER and ICEWS), AL$\\ell_0$CORE achieves predictive performance on par with or better than full Tucker at a fraction of the cost and reveals interpretable latent patterns. The work provides a complete Bayesian inference scheme with complete-conditionals and open-source code, demonstrating practical viability for exploring vast latent spaces without exponential blow-up.
Abstract
This paper introduces AL$\ell_0$CORE, a new form of probabilistic non-negative tensor decomposition. AL$\ell_0$CORE is a Tucker decomposition where the number of non-zero elements (i.e., the $\ell_0$-norm) of the core tensor is constrained to a preset value $Q$ much smaller than the size of the core. While the user dictates the total budget $Q$, the locations and values of the non-zero elements are latent variables and allocated across the core tensor during inference. AL$\ell_0$CORE -- i.e., $allo$cated $\ell_0$-$co$nstrained $core$-- thus enjoys both the computational tractability of CP decomposition and the qualitatively appealing latent structure of Tucker. In a suite of real-data experiments, we demonstrate that AL$\ell_0$CORE typically requires only tiny fractions (e.g.,~1%) of the full core to achieve the same results as full Tucker decomposition at only a correspondingly tiny fraction of the cost.
