Tensor Deli: Tensor Completion for Low CP-Rank Tensors via Random Sampling

Cullen Haselby; Mark Iwen; Santhosh Karnik; Rongrong Wang

Tensor Deli: Tensor Completion for Low CP-Rank Tensors via Random Sampling

Cullen Haselby, Mark Iwen, Santhosh Karnik, Rongrong Wang

TL;DR

Two provably accurate methods for low CP-rank tensor completion are proposed - one using adaptive sampling and one using nonadaptive sampling that work well on noisy synthetic data as well as on real world data.

Abstract

We propose two provably accurate methods for low CP-rank tensor completion - one using adaptive sampling and one using nonadaptive sampling. Both of our algorithms combine matrix completion techniques for a small number of slices along with Jennrich's algorithm to learn the factors corresponding to the first two modes, and then solve systems of linear equations to learn the factors corresponding to the remaining modes. For order-$3$ tensors, our algorithms follow a "sandwich" sampling strategy that more densely samples a few outer slices (the bread), and then more sparsely samples additional inner slices (the bbq-braised tofu) for the final completion. For an order-$d$, CP-rank $r$ tensor of size $n \times \cdots \times n$ that satisfies mild assumptions, our adaptive sampling algorithm recovers the CP-decomposition with high probability while using at most $O(nr\log r + dnr)$ samples and $O(n^2r^2+dnr^2)$ operations. Our nonadaptive sampling algorithm recovers the CP-decomposition with high probability while using at most $O(dnr^2\log n + nr\log^2 n)$ samples and runs in polynomial time. Numerical experiments demonstrate that both of our methods work well on noisy synthetic data as well as on real world data.

Tensor Deli: Tensor Completion for Low CP-Rank Tensors via Random Sampling

TL;DR

Abstract

tensors, our algorithms follow a "sandwich" sampling strategy that more densely samples a few outer slices (the bread), and then more sparsely samples additional inner slices (the bbq-braised tofu) for the final completion. For an order-

, CP-rank

tensor of size

that satisfies mild assumptions, our adaptive sampling algorithm recovers the CP-decomposition with high probability while using at most

samples and

operations. Our nonadaptive sampling algorithm recovers the CP-decomposition with high probability while using at most

samples and runs in polynomial time. Numerical experiments demonstrate that both of our methods work well on noisy synthetic data as well as on real world data.

Paper Structure (35 sections, 8 theorems, 40 equations, 8 figures, 7 algorithms)

This paper contains 35 sections, 8 theorems, 40 equations, 8 figures, 7 algorithms.

Introduction
Notation and Preliminaries
Prior Results
Low-Rank Matrix Completion
Low CP-Rank Tensor Completion
Jennrich's Algorithm
Main Contributions
Order-3 Tensors
Main Results
General Results
Adaptive Tensor Sandwich
Nonadaptive Independent Sampling Tensor Sandwich
Numerical Experiments
Simulated Data
Real Data from Applications
...and 20 more sections

Key Result

Theorem 1

Suppose that the factor matrices satisfy the following assumptions Then, with probability at least $1-s\delta$, Algorithm alg:3mode_adapt completes $\mathcal{T}$ and uses at most samples.

Figures (8)

Figure 1: Illustration of the sampling patterns of our adaptive tensor sandwich (left) and nonadaptive tensor sandwich (right) algorithms
Figure 2: Median relative error (log-scaled) of completed four mode tensors of varying rank as sample complexity increases without noise. Each value is the median of ten trials, $n=100$, $d=4$, $s=2, \gamma \in [0.1,0.8], \delta=8$. We compare the relative errors of adaptive Tensor Deli before and after ten iterations of masked-ALS, as well as just masked ALS alone.
Figure 3: Median relative error (log-scaled) of completed four mode tensors of varying rank as sample complexity increases without noise. Each value is the median of ten trials, $n=100$, $d=4$, $s=2,\gamma \in [0.1,0.8], \delta=8$. We show errors for both adaptive and non-adaptive TD sampling schemes.
Figure 4: Comparison of runtime, accuracy and sample complexity for different sized three mode tensors, $n\in[50,120]$, $d=3$, $r=0.1n$. Tensor Deli utilizes KS to complete two densely sampled slices and also performs ten iterations of ALS. This is compared with KS alone used to complete the entire tensor. For this simulated data, Tensor Deli is able to achieve faster runtimes, better accuracy, and at a lower sample complexity than KS. Here we average errors, runtime, and utilized samples over ten independent trials for each choice of $n$.
Figure 5: Median relative error (log-scaled) of completed three mode tensors of varying rank with additive noise for adaptive Tensor Deli with adaptive sampling method, and KS. Each value is the median of ten trials, for Tensor Deli $n=50$, $d=3$, $s=2, \delta=8$, for KS the proportion of entries that can be sampled for a fiber is 0.7.
...and 3 more figures

Theorems & Definitions (10)

Theorem 1
Theorem 2
Theorem 3
Theorem 4
Theorem 5
Theorem 6
Lemma 1
proof
Lemma 2
proof

Tensor Deli: Tensor Completion for Low CP-Rank Tensors via Random Sampling

TL;DR

Abstract

Tensor Deli: Tensor Completion for Low CP-Rank Tensors via Random Sampling

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (10)