Tensor Methods in High Dimensional Data Analysis: Opportunities and Challenges
Arnab Auddy, Dong Xia, Ming Yuan
TL;DR
This survey outlines how tensor methods unlock analysis of high-dimensional, multiway data across fields, highlighting the interpretability, identifiability, and robust inference benefits of preserving multilinear structure. It surveys core tools—CP and Tucker decompositions, plus algorithms like alternating minimization, HOOI, tensor power iterations, and gradient methods—and discusses strategies to address nonconvexity via spectral initialization and convex relaxations. The eight statistical settings (tensor SVD, multiway PCA, ICA, mixtures, tensor completion, tensor regression, higher-order networks, and tensor time series) reveal a pervasive theme: a tradeoff between statistical optimality and computational feasibility, with clear computational-statistical gaps in several problems. The paper emphasizes a cross-disciplinary mix of statistics, optimization, and numerical linear algebra, and highlights practical implications for applications in science and engineering where tensor-structured data are prevalent.
Abstract
Large amount of multidimensional data represented by multiway arrays or tensors are prevalent in modern applications across various fields such as chemometrics, genomics, physics, psychology, and signal processing. The structural complexity of such data provides vast new opportunities for modeling and analysis, but efficiently extracting information content from them, both statistically and computationally, presents unique and fundamental challenges. Addressing these challenges requires an interdisciplinary approach that brings together tools and insights from statistics, optimization and numerical linear algebra among other fields. Despite these hurdles, significant progress has been made in the last decade. This review seeks to examine some of the key advancements and identify common threads among them, under eight different statistical settings.
