Table of Contents
Fetching ...

Superfast iterative refinement of low rank approximation of a matrix based on ALS method and random sampling

Qi Luan, Victor Y. Pan

TL;DR

This work addresses the problem of computing low-rank approximations for massive matrices at sublinear cost by combining ALS-based iterative refinement with random sampling. It develops both deterministic and randomized (superfast) iterative refinement frameworks that progressively improve an initial CUR LRA toward a near-optimal solution, with convergence guarantees expressed in terms of principal-angle distances and under specific spectral-gap assumptions. A key contribution is the integration of leverage-score-based sampling and LLSP compression to produce near-optimal CUR LRA whp, while preserving sparsity and structure. Numerical experiments show rapid improvement within a few iterations, often outperforming initial LRAs and demonstrating practical sublinear performance on real-world data.

Abstract

A matrix algorithm runs superfast (aka at sublinear cost) if it involves much fewer flops and memory cells than an input matrix has entries. Big Data are frequently represented by matrices of immense sizes that cannot be handled directly but can be approximated with low rank matrices, with which one can operate superfast. Superfast computation of Low Rank Approximations (LRA) of a matrix, however, can be a challenge. Any superfast LRA algorithm fails miserably on worst case matrices. Fortunately, they rarely appear in computational practice. For an important example, superfast Adaptive Cross--Approximation iterations have consistently output accurate LRA of a large and important class of matrices during the decades of their worldwide application. Furthermore, they output CUR LRA, which is a special attractive form of LRA, widely applied in data analysis. %VP Adequate formal support for that empirical behavior, however, is still a challenge. Encouraged by success of these iterations we present a superfast randomized iterative refinement of a crude CUR LRA by means of combining random sampling with the Alternating Least Squares Method, which reduces this refinement to recursive solution of generalized Linear Least Squares Problems (LLSPs). We prove monotone convergence of our iterations with a high probability to a near-optimal LRA of an input matrix under some specified assumptions on it and on a crude approximate solution to an initial generalized LLSP. In our numerical tests two or three iterations of our algorithm have consistently and significantly improved crude initial LRAs of real world inputs.

Superfast iterative refinement of low rank approximation of a matrix based on ALS method and random sampling

TL;DR

This work addresses the problem of computing low-rank approximations for massive matrices at sublinear cost by combining ALS-based iterative refinement with random sampling. It develops both deterministic and randomized (superfast) iterative refinement frameworks that progressively improve an initial CUR LRA toward a near-optimal solution, with convergence guarantees expressed in terms of principal-angle distances and under specific spectral-gap assumptions. A key contribution is the integration of leverage-score-based sampling and LLSP compression to produce near-optimal CUR LRA whp, while preserving sparsity and structure. Numerical experiments show rapid improvement within a few iterations, often outperforming initial LRAs and demonstrating practical sublinear performance on real-world data.

Abstract

A matrix algorithm runs superfast (aka at sublinear cost) if it involves much fewer flops and memory cells than an input matrix has entries. Big Data are frequently represented by matrices of immense sizes that cannot be handled directly but can be approximated with low rank matrices, with which one can operate superfast. Superfast computation of Low Rank Approximations (LRA) of a matrix, however, can be a challenge. Any superfast LRA algorithm fails miserably on worst case matrices. Fortunately, they rarely appear in computational practice. For an important example, superfast Adaptive Cross--Approximation iterations have consistently output accurate LRA of a large and important class of matrices during the decades of their worldwide application. Furthermore, they output CUR LRA, which is a special attractive form of LRA, widely applied in data analysis. %VP Adequate formal support for that empirical behavior, however, is still a challenge. Encouraged by success of these iterations we present a superfast randomized iterative refinement of a crude CUR LRA by means of combining random sampling with the Alternating Least Squares Method, which reduces this refinement to recursive solution of generalized Linear Least Squares Problems (LLSPs). We prove monotone convergence of our iterations with a high probability to a near-optimal LRA of an input matrix under some specified assumptions on it and on a crude approximate solution to an initial generalized LLSP. In our numerical tests two or three iterations of our algorithm have consistently and significantly improved crude initial LRAs of real world inputs.

Paper Structure

This paper contains 34 sections, 9 theorems, 71 equations, 3 figures, 1 table, 3 algorithms.

Key Result

Theorem 2.1

The Eckart-Young-Mirsky theorem: LRA by means of truncation of SVD (see S907EY36M60, GL13). The matrix $M_r$, the $r$-truncation of SVD of $M$, defines an optimal rank-$r$ approximation of $M$ under both spectral and Frobenius norms: $||M_r-M||= \sigma_{r+1}(M)$ and $\sigma_{F,r+1}(M):=||M_r-M||_F= \sqrt{\sum_{j> r}\sigma_j^2(M)}$.

Figures (3)

  • Figure 1: The first updates of the matrices $A_t$ and $B_t$ in Eqn. (\ref{['eqllsps']}).
  • Figure 2: Part (a) displays the original problem of solving LLSP (defined by two matrices $A$ and $M$). Parts (b) and (c) show compression of these matrices into two matrices $M'$ and $A'$ of smaller sizes. Part (d) displays compressed problem defined by the pair of matrices $M'$ and $A'$.
  • Figure 3: Principal Angle Distance with Least Squares Regression in Refinement Alg. \ref{['algalter']}

Theorems & Definitions (33)

  • Definition 2.1
  • Definition 2.2
  • Definition 2.3
  • Remark 2.1
  • Theorem 2.1
  • Example 2.1
  • Remark 2.2
  • Theorem 3.1
  • Remark 3.1
  • proof
  • ...and 23 more