Invariant subspaces and PCA in nearly matrix multiplication time

Aleksandros Sobczyk; Marko Mladenović; Mathieu Luisier

Invariant subspaces and PCA in nearly matrix multiplication time

Aleksandros Sobczyk, Marko Mladenović, Mathieu Luisier

TL;DR

New matrix multiplication-type bit complexity upper bounds for PCA problems, including classical PCA and (randomized) low-rank approximation are obtained, and a new $O(n^{\omega+\eta})$ stability analysis for the Cholesky factorization, and a smoothed analysis for computing spectral gaps are obtained.

Abstract

Approximating invariant subspaces of generalized eigenvalue problems (GEPs) is a fundamental computational problem at the core of machine learning and scientific computing. It is, for example, the root of Principal Component Analysis (PCA) for dimensionality reduction, data visualization, and noise filtering, and of Density Functional Theory (DFT), arguably the most popular method to calculate the electronic structure of materials. Given Hermitian $H,S\in\mathbb{C}^{n\times n}$, where $S$ is positive-definite, let $Π_k$ be the true spectral projector on the invariant subspace that is associated with the $k$ smallest (or largest) eigenvalues of the GEP $HC=SCΛ$, for some $k\in[n]$. We show that we can compute a matrix $\widetildeΠ_k$ such that $\lVertΠ_k-\widetildeΠ_k\rVert_2\leq ε$, in $O\left( n^{ω+η}\mathrm{polylog}(n,ε^{-1},κ(S),\mathrm{gap}_k^{-1}) \right)$ bit operations in the floating point model, for some $ε\in(0,1)$, with probability $1-1/n$. Here, $η>0$ is arbitrarily small, $ω\lesssim 2.372$ is the matrix multiplication exponent, $κ(S)=\lVert S\rVert_2\lVert S^{-1}\rVert_2$, and $\mathrm{gap}_k$ is the gap between eigenvalues $k$ and $k+1$. To achieve such provable "forward-error" guarantees, our methods rely on a new $O(n^{ω+η})$ stability analysis for the Cholesky factorization, and a smoothed analysis for computing spectral gaps, which can be of independent interest. Ultimately, we obtain new matrix multiplication-type bit complexity upper bounds for PCA problems, including classical PCA and (randomized) low-rank approximation.

Invariant subspaces and PCA in nearly matrix multiplication time

TL;DR

New matrix multiplication-type bit complexity upper bounds for PCA problems, including classical PCA and (randomized) low-rank approximation are obtained, and a new

stability analysis for the Cholesky factorization, and a smoothed analysis for computing spectral gaps are obtained.

Abstract

, where

is positive-definite, let

be the true spectral projector on the invariant subspace that is associated with the

smallest (or largest) eigenvalues of the GEP

, for some

. We show that we can compute a matrix

such that

, in

bit operations in the floating point model, for some

, with probability

. Here,

is arbitrarily small,

is the matrix multiplication exponent,

, and

is the gap between eigenvalues

and

. To achieve such provable "forward-error" guarantees, our methods rely on a new

stability analysis for the Cholesky factorization, and a smoothed analysis for computing spectral gaps, which can be of independent interest. Ultimately, we obtain new matrix multiplication-type bit complexity upper bounds for PCA problems, including classical PCA and (randomized) low-rank approximation.

Paper Structure (60 sections, 46 theorems, 274 equations, 8 algorithms)

This paper contains 60 sections, 46 theorems, 274 equations, 8 algorithms.

Introduction
Problem definition
Type of approximation:
Model of computation:
Bit complexity:
Matrix multiplication time:
Existing algorithms
Contributions and methods
Notation
Computing spectral projectors with the sign function
Fast spectral gaps with counting queries
Smoothed analysis of eigenvalue counting
Computing the gap and the midpoint
Sketch proof of Theorem \ref{['theorem:spectral_projector']}
Comparison with diagonalization
...and 45 more sections

Key Result

Theorem 1.1

Let $(\boldsymbol{\mathrm{H}},\boldsymbol{\mathrm{S}})$ be a Hermitian definite pencil of size $n$ with $\|\boldsymbol{\mathrm{H}}\|,\|\boldsymbol{\mathrm{S}}^{-1}\|\leq 1$, $\lambda_1\leq\lambda_2\leq\ldots\leq \lambda_n$ its eigenvalues, $\mathop{\mathrm{gap}}\nolimits_k=\lambda_{k+1}-\lambda_k$ a takes as inputs $\boldsymbol{\mathrm{H}}$, $\boldsymbol{\mathrm{S}}$, an integer $k\in [n-1]$, an e

Theorems & Definitions (89)

Theorem 1.1
Theorem 1.2
Proposition 2.1
proof
Proposition 3.1
Proposition 3.2
proof
Theorem 3.1: $\mathsf{GAP}$
proof
Theorem 4.1: PCA
...and 79 more

Invariant subspaces and PCA in nearly matrix multiplication time

TL;DR

Abstract

Invariant subspaces and PCA in nearly matrix multiplication time

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (89)