Efficient Alternating Minimization with Applications to Weighted Low Rank Approximation

Zhao Song; Mingquan Ye; Junze Yin; Lichen Zhang

Efficient Alternating Minimization with Applications to Weighted Low Rank Approximation

Zhao Song, Mingquan Ye, Junze Yin, Lichen Zhang

TL;DR

This paper tackles weighted low rank approximation, a problem known to be NP-hard, by developing an efficient alternating minimization framework that allows approximate updates. Central to the approach is a high-accuracy regression solver based on sketching (SRHT) that preconditions the regression problems and enables nearly linear per-iteration time, with the analysis demonstrating robustness to forward-approximate updates. The authors prove a main result: under standard assumptions on incoherence and spectral gaps, the algorithm converges in $O(\log(1/\epsilon))$ iterations and achieves a spectral-norm error of $O(\alpha^{-1} k\tau)\|W\circ N\| + \epsilon$, with time $\widetilde{O}((\|W\|_0\,k + nk^3)\log(1/\epsilon))$ per iteration. This bridges theory and practice by accommodating inexact solvers while preserving convergence guarantees, and it yields substantial speedups over previous exact-regression-based methods. The work has practical impact on problems like weighted matrix completion and related low-rank recovery tasks where sparsity in the weight and scalability are crucial.

Abstract

Weighted low rank approximation is a fundamental problem in numerical linear algebra, and it has many applications in machine learning. Given a matrix $M \in \mathbb{R}^{n \times n}$, a non-negative weight matrix $W \in \mathbb{R}_{\geq 0}^{n \times n}$, a parameter $k$, the goal is to output two matrices $X,Y\in \mathbb{R}^{n \times k}$ such that $\| W \circ (M - X Y^\top) \|_F$ is minimized, where $\circ$ denotes the Hadamard product. It naturally generalizes the well-studied low rank matrix completion problem. Such a problem is known to be NP-hard and even hard to approximate assuming the Exponential Time Hypothesis [GG11, RSW16]. Meanwhile, alternating minimization is a good heuristic solution for weighted low rank approximation. In particular, [LLR16] shows that, under mild assumptions, alternating minimization does provide provable guarantees. In this work, we develop an efficient and robust framework for alternating minimization that allows the alternating updates to be computed approximately. For weighted low rank approximation, this improves the runtime of [LLR16] from $\|W\|_0k^2$ to $\|W\|_0 k$ where $\|W\|_0$ denotes the number of nonzero entries of the weight matrix. At the heart of our framework is a high-accuracy multiple response regression solver together with a robust analysis of alternating minimization.

Efficient Alternating Minimization with Applications to Weighted Low Rank Approximation

TL;DR

iterations and achieves a spectral-norm error of

, with time

per iteration. This bridges theory and practice by accommodating inexact solvers while preserving convergence guarantees, and it yields substantial speedups over previous exact-regression-based methods. The work has practical impact on problems like weighted matrix completion and related low-rank recovery tasks where sparsity in the weight and scalability are crucial.

Abstract

Weighted low rank approximation is a fundamental problem in numerical linear algebra, and it has many applications in machine learning. Given a matrix

, a non-negative weight matrix

, a parameter

, the goal is to output two matrices

such that

is minimized, where

denotes the Hadamard product. It naturally generalizes the well-studied low rank matrix completion problem. Such a problem is known to be NP-hard and even hard to approximate assuming the Exponential Time Hypothesis [GG11, RSW16]. Meanwhile, alternating minimization is a good heuristic solution for weighted low rank approximation. In particular, [LLR16] shows that, under mild assumptions, alternating minimization does provide provable guarantees. In this work, we develop an efficient and robust framework for alternating minimization that allows the alternating updates to be computed approximately. For weighted low rank approximation, this improves the runtime of [LLR16] from

where

denotes the number of nonzero entries of the weight matrix. At the heart of our framework is a high-accuracy multiple response regression solver together with a robust analysis of alternating minimization.

Paper Structure (54 sections, 32 theorems, 242 equations, 6 tables, 4 algorithms)

This paper contains 54 sections, 32 theorems, 242 equations, 6 tables, 4 algorithms.

Introduction
Roadmap.
Preliminary
Notation
Sketching
Background on Weighted Low Rank Approximation
Technique Overview
Our Results
Weighted Multiple Response Regression
Robustness Analysis for Approximate Update
Main Result
Comparisons with Recent Works
Conclusion
Roadmap.
Basic Definitions and Algebra Tools
...and 39 more sections

Key Result

Theorem 1.1

There is an algorithm (see Algorithm alg:main) that runs in $\widetilde{O}((\|W\|_0\cdot k+nk^3) \log(1/\epsilon))$ time and outputs a rank-$k$ matrix $\widetilde{M}$ such that where $\tau$ is the condition number of $M^*$ and $\widetilde{O}(\cdot)$ suppresses polylogarithmic factors in $n$ and $k$.

Theorems & Definitions (78)

Theorem 1.1: Informal version of Theorem \ref{['thm:main']}
Remark 1.2
Definition 2.1: SRHT ldfu13
Definition 2.2: Oblivious subspace embedding s06
Lemma 4.1
Lemma 4.2
Lemma 4.3
Definition 4.4
Lemma 4.5
Theorem 4.6: Formal version of Theorem \ref{['thm:main_informal']}
...and 68 more

Efficient Alternating Minimization with Applications to Weighted Low Rank Approximation

TL;DR

Abstract

Efficient Alternating Minimization with Applications to Weighted Low Rank Approximation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (78)