Joint Majorization-Minimization for Nonnegative CP and Tucker Decompositions under $β$-Divergences: Unfolding-Free Updates
Valentin Leplat
TL;DR
This paper addresses efficient, unfolding-free optimization for nonnegative CP and Tucker tensor decompositions under the β-divergence losses. The authors derive contraction-based majorization-minimization updates that express all numerators and denominators as tensor contractions, enabling direct einsum-style implementations without explicit unfoldings. Their main advance is a joint majorization strategy that builds a single surrogate at a reference point and reduces it via cheap inner updates while reusing cached reference quantities, yielding substantial practical speedups. They prove the majorizers are tight, establish monotonic descent of the objective, and demonstrate convergence of the objective values, with BSUM-based arguments discussed for limit points. Extensive experiments on synthetic data and a real Uber pickups tensor show significant wall-clock-time gains over unfolding-based baselines and competitive performance against an einsum-factorization framework, highlighting the approach’s scalability and potential for large-scale, nonnegative multilinear factorization.
Abstract
We study majorization-minimization methods for nonnegative tensor decompositions under the $β$-divergence family, focusing on nonnegative CP and Tucker models. Our aim is to avoid explicit mode unfoldings and large auxiliary matrices by deriving separable surrogates whose multiplicative updates can be implemented using only tensor contractions (einsum-style operations). We present both classical block-MM updates in contraction-only form and a joint majorization strategy, inspired by joint MM for matrix $β$-NMF, that reuses cached reference quantities across inexpensive inner updates. We prove tightness of the proposed majorizers, establish monotonic decrease of the objective, and show convergence of the sequence of objective values; we also discuss how BSUM theory applies to the block-MM scheme for analyzing limit points. Finally, experiments on synthetic tensors and the Uber spatiotemporal count tensor demonstrate substantial speedups over unfolding-based baselines and a recent einsum-factorization framework.
