Optimal Matrix-Mimetic Tensor Algebras via Variable Projection
Elizabeth Newman, Katherine Keegan
TL;DR
This work develops a matrix-mimetic tensor framework in which a learnable invertible transform $ extbf{M}$ governs the underlying tensor algebra via the $ extbackslash star_{ extbf{M}}$-product. By applying variable projection, they couple the transform with the desired tensor representation, enabling simultaneous learning of $ extbf{M}$ and the tensor factors, while preserving matrix-like optimality properties such as the Eckart–Young-like theorem for the $t$-SVDM. They establish invariance and uniqueness results, derive derivatives through the $t$-SVDM, and provide convergence guarantees for a Riemannian-gradient-based optimization on the orthogonal group. Through extensive experiments in financial index tracking, image compression, and reduced-order modeling, the learned transform $ extbf{M}^*$ consistently yields higher-quality, more compressible representations than heuristic transforms. The framework affords broad applicability, robust performance under noise, and a principled pathway to learn data-adaptive tensor algebras with practical computational trade-offs and publicly available code.
Abstract
Recent advances in {matrix-mimetic} tensor frameworks have made it possible to preserve linear algebraic properties for multilinear data analysis and, as a result, to obtain optimal representations of multiway data. Matrix mimeticity arises from interpreting tensors as operators that can be multiplied, factorized, and analyzed analogous to matrices. Underlying the tensor operation is an algebraic framework parameterized by an invertible linear transformation. The choice of linear mapping is crucial to representation quality and, in practice, is made heuristically based on expected correlations in the data. However, in many cases, these correlations are unknown and common heuristics lead to suboptimal performance. In this work, we simultaneously learn optimal linear mappings and corresponding tensor representations without relying on prior knowledge of the data. Our new framework explicitly captures the coupling between the transformation and representation using variable projection. We preserve the invertibility of the linear mapping by learning orthogonal transformations with Riemannian optimization. We provide original theory of uniqueness of the transformation and convergence analysis of our variable-projection-based algorithm. We demonstrate the generality of our framework through numerical experiments on a wide range of applications, including financial index tracking, image compression, and reduced order modeling. We have published all the code related to this work at https://github.com/elizabethnewman/star-M-opt.
