Streaming data recovery via Bayesian tensor train decomposition
Yunyu Huang, Yani Feng, Qifeng Liao
TL;DR
We address streaming data recovery for high-order tensors by learning a Bayesian Tensor Train (TT) decomposition, where a tensor element $x_{\boldsymbol{j}}$ is approximated as $x_{\boldsymbol{j}} \approx \prod_{d=1}^D \mathscr{G}^{(d)}_{j_d}$ with TT-cores and TT-ranks. The method SPTT uses streaming variational Bayes to update the posterior over TT-cores and the noise precision online, given batches $B_t$ without revisiting past data. A Gaussian prior on TT-cores, a Gamma prior on the noise precision, and closed-form variational updates yield efficient per-batch updates with time $\mathcal{O}(S D L^4)$ and space $\mathcal{O}(N D L^2)$, enabling uncertainty quantification in streaming settings. Empirical results on synthetic and real-world datasets show SPTT outperforms existing Bayesian streaming methods and static TT/Tucker/CP baselines in reconstruction accuracy and predictive performance, demonstrating effective online, uncertainty-aware recovery for high-order streaming data.
Abstract
In this paper, we study a Bayesian tensor train (TT) decomposition method to recover streaming data by approximating the latent structure in high-order streaming data. Drawing on the streaming variational Bayes method, we introduce the TT format into Bayesian tensor decomposition methods for streaming data, and formulate posteriors of TT cores. Thanks to the Bayesian framework of the TT format, the proposed algorithm (SPTT) excels in recovering streaming data with high-order, incomplete, and noisy properties. The experiments in synthetic and real-world datasets show the accuracy of our method compared to state-of-the-art Bayesian tensor decomposition methods for streaming data.
