Fast algorithms for least square problems with Kronecker lower subsets
Osman Asif Malik, Yiming Xu, Nuojin Cheng, Stephen Becker, Alireza Doostan, Akil Narayan
TL;DR
The paper tackles the high computational cost of exact leverage score sampling for large-scale least squares problems when the design matrix is a lower column subset of a Kronecker product. It introduces an efficient algorithm that uses offline QR factorizations of the factor matrices and a two-step online sampling procedure, guaranteeing that the sampled rows follow the exact leverage-score distribution without forming the full matrix. The authors provide theoretical guarantees, including an equivalence between the proposed procedure and exact leverage-score sampling, and demonstrate practical gains through experiments on polynomial chaos expansions for a Duffing oscillator, the Ishigami function, and battery remaining useful life, showing improvements over uniform and approximate sampling. The work highlights the potential of exploiting Kronecker-structured lower-subset matrices to achieve accurate, scalable sketching for high-dimensional uncertainty quantification tasks, and it suggests future work on broader subset structures and automation of suitable permutations.
Abstract
While leverage score sampling provides powerful tools for approximating solutions to large least squares problems, the cost of computing exact scores and sampling often prohibits practical application. This paper addresses this challenge by developing a new and efficient algorithm for exact leverage score sampling applicable to matrices that are lower column subsets of Kronecker product matrices. We synthesize relevant approximation guarantees and detail the algorithm that specifically leverages this structural property for computational efficiency. Through numerical examples, we demonstrate that utilizing efficiently computed exact leverage scores via our methods significantly reduces approximation errors, as compared to established approximate leverage score sampling strategies when applied to this important class of structured matrices.
