Multiple Testing of Linear Forms for Noisy Matrix Completion

Wanteng Ma; Lilun Du; Dong Xia; Ming Yuan

Multiple Testing of Linear Forms for Noisy Matrix Completion

Wanteng Ma, Lilun Du, Dong Xia, Ming Yuan

TL;DR

This work tackles the problem of performing multiple hypothesis tests for linear forms in noisy, low-rank matrix completion. It introduces a novel test statistic built on three steps—gradient-descent initialization, bias correction, and a low-rank incoherence-aware projection—yielding sharp marginal and joint normality for $\langle M, T\rangle$ under $Y = \langle M, X\rangle + \xi$. To control FDR across many tests, the authors develop a data-splitting and symmetric aggregation strategy that leverages weak dependence among statistics, and further enhance performance via whitening and LASSO-based screening to handle stronger dependencies. They provide non-asymptotic FDR/power guarantees that scale with sample size and model parameters, and validate the framework with extensive simulations and real data (MovieLens and Rossmann datasets), demonstrating practical gains in reliable discovery for recommender-system settings. The approach offers a principled path toward uncertainty-aware, scalable inference in high-dimensional, missing-data matrix problems with broad applicability beyond recommender systems.

Abstract

Many important tasks of large-scale recommender systems can be naturally cast as testing multiple linear forms for noisy matrix completion. These problems, however, present unique challenges because of the subtle bias-and-variance tradeoff of and an intricate dependence among the estimated entries induced by the low-rank structure. In this paper, we develop a general approach to overcome these difficulties by introducing new statistics for individual tests with sharp asymptotics both marginally and jointly, and utilizing them to control the false discovery rate (FDR) via a data splitting and symmetric aggregation scheme. We show that valid FDR control can be achieved with guaranteed power under nearly optimal sample size requirements using the proposed methodology. Extensive numerical simulations and real data examples are also presented to further illustrate its practical merits.

Multiple Testing of Linear Forms for Noisy Matrix Completion

TL;DR

under

. To control FDR across many tests, the authors develop a data-splitting and symmetric aggregation strategy that leverages weak dependence among statistics, and further enhance performance via whitening and LASSO-based screening to handle stronger dependencies. They provide non-asymptotic FDR/power guarantees that scale with sample size and model parameters, and validate the framework with extensive simulations and real data (MovieLens and Rossmann datasets), demonstrating practical gains in reliable discovery for recommender-system settings. The approach offers a principled path toward uncertainty-aware, scalable inference in high-dimensional, missing-data matrix problems with broad applicability beyond recommender systems.

Abstract

Paper Structure (46 sections, 23 theorems, 261 equations, 10 figures, 2 tables, 3 algorithms)

This paper contains 46 sections, 23 theorems, 261 equations, 10 figures, 2 tables, 3 algorithms.

Introduction
An Approach for Individual Test
Multiple Tests
Symmetric Data Aggregation
Dependence among Test Statistics
Inference of a submatrix.
Inference of entrywise comparisons.
Inference of several user/feature groups.
Theoretical Guarantees
Whitening and Screening
Numerical Experiments
Simulation Studies
Variance of linear forms
Data aggregation under weak dependency
Whitening and screening
...and 31 more sections

Key Result

Theorem 1

Given $\widehat{M}$ from eq:lr-retraction with dimension ratio $\alpha_d=d_1/d_2$, suppose that the sample size and and SNR condition satisfies: for some constants $C_1,C_2>0$. Then there exists a constant $C_3>0$ such that for any $T \in \mathbb{R}^{d_1\times d_2}$ satisfying eq:alignment,

Figures (10)

Figure 1: The difference between empirical distribution functions and $\Phi(z)$. Here, we compare our $W_T$ with the former method xia2021statistical. We set the matrix with $d_1=d_2=\lambda_{\min}=400$, and $r=3$, and vary the number of random samples $n$ in noisy matrix completion.
Figure 2: FDR control & Power of different data aggregation schemes in blockwise matrix tests with $\alpha=0.1$. Here the signal is defined by $\mu$ in eq. (\ref{['eq:construct_H0']}).
Figure 3: FDR control & Power of different data aggregation schemes in row tests with $\alpha=0.1$. Here the signal is defined by $\mu$ in eq. (\ref{['eq:construct_H0']}).
Figure 4: ROC curve for different test statistics. Here the signal is defined by $\mu$ in eq. (\ref{['eq:construct_H0']}).
Figure 5: FDR control & Power of different data aggregation schemes for entry comparisons between rows with $\alpha=0.1$ when the noises are heavy-tailed distributed
...and 5 more figures

Theorems & Definitions (36)

Theorem 1
Theorem 2: Minimax optimal length of confidence interval
Theorem 3
Theorem 4
Theorem 5
Theorem 6
Theorem 7
Theorem 8
Proposition 1: Finite-sample guarantee of weak correlation after screening
Proposition 2: LASSO screening
...and 26 more

Multiple Testing of Linear Forms for Noisy Matrix Completion

TL;DR

Abstract

Multiple Testing of Linear Forms for Noisy Matrix Completion

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (36)