Factor Augmented Tensor-on-Tensor Neural Networks

Guanhao Zhou; Yuefeng Han; Xiufan Yu

Factor Augmented Tensor-on-Tensor Neural Networks

Guanhao Zhou, Yuefeng Han, Xiufan Yu

TL;DR

The paper addresses tensor-on-tensor time series forecasting by proposing FATTNN, a hybrid framework that first uses a low-rank Tensor Factor Model (TFM) to extract a latent factor tensor ${\cal F}_t$ from covariates ${\cal X}_t$ via ${\cal X}_t = {\cal F}_t \times_1 A_1 \times_2 \cdots \times_K A_K + {\cal E}_t$, with loading matrices estimated by TIPUP (and refined by iTIPUP). It then applies a Tensor-on-Tensor Neural Network (TTNN) based on a Temporal Convolutional Network (TCN) to predict tensor responses ${\cal Y}_t$ from the estimated factors, enabling nonlinearity capture while preserving tensor structure. The approach delivers substantial prediction improvements and computational efficiency over flattening-based, linear, and several deep-learning baselines, across simulated and real datasets (FAO, NYC Taxi, and FMRI), though Conv-TT-LSTM may occasionally outperform in image-like FMRI tasks. Overall, FATTNN offers a scalable, accurate, and versatile framework for tensor-on-tensor forecasting by uniting tensor factorization with neural temporal modeling, with broad potential extensions to non-temporal data and higher-order tensor structures.

Abstract

This paper studies the prediction task of tensor-on-tensor regression in which both covariates and responses are multi-dimensional arrays (a.k.a., tensors) across time with arbitrary tensor order and data dimension. Existing methods either focused on linear models without accounting for possibly nonlinear relationships between covariates and responses, or directly employed black-box deep learning algorithms that failed to utilize the inherent tensor structure. In this work, we propose a Factor Augmented Tensor-on-Tensor Neural Network (FATTNN) that integrates tensor factor models into deep neural networks. We begin with summarizing and extracting useful predictive information (represented by the ``factor tensor'') from the complex structured tensor covariates, and then proceed with the prediction task using the estimated factor tensor as input of a temporal convolutional neural network. The proposed methods effectively handle nonlinearity between complex data structures, and improve over traditional statistical models and conventional deep learning approaches in both prediction accuracy and computational cost. By leveraging tensor factor models, our proposed methods exploit the underlying latent factor structure to enhance the prediction, and in the meantime, drastically reduce the data dimensionality that speeds up the computation. The empirical performances of our proposed methods are demonstrated via simulation studies and real-world applications to three public datasets. Numerical results show that our proposed algorithms achieve substantial increases in prediction accuracy and significant reductions in computational time compared to benchmark methods.

Factor Augmented Tensor-on-Tensor Neural Networks

TL;DR

The paper addresses tensor-on-tensor time series forecasting by proposing FATTNN, a hybrid framework that first uses a low-rank Tensor Factor Model (TFM) to extract a latent factor tensor

from covariates

via

, with loading matrices estimated by TIPUP (and refined by iTIPUP). It then applies a Tensor-on-Tensor Neural Network (TTNN) based on a Temporal Convolutional Network (TCN) to predict tensor responses

from the estimated factors, enabling nonlinearity capture while preserving tensor structure. The approach delivers substantial prediction improvements and computational efficiency over flattening-based, linear, and several deep-learning baselines, across simulated and real datasets (FAO, NYC Taxi, and FMRI), though Conv-TT-LSTM may occasionally outperform in image-like FMRI tasks. Overall, FATTNN offers a scalable, accurate, and versatile framework for tensor-on-tensor forecasting by uniting tensor factorization with neural temporal modeling, with broad potential extensions to non-temporal data and higher-order tensor structures.

Abstract

Paper Structure (29 sections, 8 theorems, 70 equations, 8 figures, 6 tables, 1 algorithm)

This paper contains 29 sections, 8 theorems, 70 equations, 8 figures, 6 tables, 1 algorithm.

Introduction
Related Work
Preliminaries
Methodology: Factor Augmented Tensor-on-Tensor Time Series Regression
Tensor Factor Model (TFM)
Tensor-on-Tensor Neural Network based TCN (TTNN)
Numerical Results
Simulation Studies
Real Data Examples
Summary & Discussion
Acknowledgments
Appendix A: Tensor Factor Models
Notation
Preliminary of matrix and tensor algebra
Technical lemmas
...and 14 more sections

Key Result

Proposition 1

Assume each elements of the idiosyncratic noise ${\cal E}_t$ in tfm are i.i.d. $N(0,1)$. The ranks $r_1,...,r_K$ are fixed. The factor process ${\cal F}_t$ is weakly stationary and its cross-outer-product process is ergodic in the sense of $\frac{1}{n}\sum_{t=1}^n {\cal F}_{t}\otimes{\cal F}_t \righ

Figures (8)

Figure 1: A graphical illustration of the proposed FATTNN. The input is the observed time series $\{ ({\cal X}_t,{\cal Y}_t) \}_{t=1}^{n}$ and the new covariates $\{ {\cal X}_t \}_{t=n+1}^{n+m}$. The output is the forecasted future tensor responses, denoted by $\{\widehat{{\cal Y}}_t\}_{t=n+1}^{n+m}$
Figure 2: Left Panel: Comparisons between ground-truth values and predicted Production of Treenuts (top row) and Yield of Oilcrops (bottom row) in South America using the crop data of 33 countries in East Asia, North America, and Europe; Right Panel: Comparisons between ground-truth values and predicted Area-harvested (top row) and Yield (bottom row) of citrus fruit in 26 selected countries in Europe using livestock data of the same 26 countries. Values are plotted on the log-transformed scale. For readability, numerical values from the figures are tabulated in Appendix B. In each panel, from left to right: Ground truth, FATTNN, and TCN.
Figure 3: Comparisons between ground-truth values and predicted pick-up and drop-off volumes using various methods. "District A to B" denotes the traffic volume that passengers were picked up in District A and dropped off in District B. The district numbering is assigned according to the Manhattan district map shown in Figure \ref{['fig:ManhattanMap']}. An enlarged figure and more detailed descriptions are in Appendix B. Left: Midtown Manhattan; Right: Downtown Manhattan.
Figure S.1: Comparisons between ground-truth values and predicted Production of Treenuts (top row) and Yield of Oilcrops (bottom row) in South America using the crop data of 33 countries in East Asia, North America, and Europe. Values are plotted on the log-transformed scale. From left to right: Ground truth, FATTNN, TCN, LSTM, TRL, and Conv-TT-LSTM.
Figure S.2: Comparisons between ground-truth values and predicted Area-harvested (top row) and Yield (bottom row) of citrus fruit in 26 selected countries in Europe using the livestock data of the same 26 countries. Values are plotted on the log-transformed scale. From left to right: Ground truth, FATTNN, TCN, LSTM, TRL, and Conv-TT-LSTM.
...and 3 more figures

Theorems & Definitions (10)

Proposition 1
Lemma 1: Weyl's inequality
Lemma 2: Davis–Kahan $\sin\Theta$ theorem
Lemma 3: $\epsilon$-covering of matrix norms, han2020tensor
Lemma 4: Slepian’s inequality, vershynin2018high
Lemma 5: Sudakov-Fernique inequality, vershynin2018high
Lemma 6: Gaussian Concentration of Lipschitz functions, vershynin2018high
Lemma 7: Sigular values of Gaussian random matrices
proof
proof

Factor Augmented Tensor-on-Tensor Neural Networks

TL;DR

Abstract

Factor Augmented Tensor-on-Tensor Neural Networks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (10)