Construction of Hierarchically Semi-Separable matrix Representation using Adaptive Johnson-Lindenstrauss Sketching
Yotam Yaniv, Pieter Ghysels, Osman Asif Malik, Henry A. Boateng, Xiaoye S. Li
TL;DR
This work extends adaptive HSS compression from Gaussian sketches to the broader class of Johnson-Lindenstrauss (JL) sketching operators, enabling faster, partially matrix-free HSS construction for large dense matrices. It establishes Frobenius-norm concentration and range-finder bounds for general JL sketches, including SJLT and SRHT, and implements these schemes in STRUMPACK with detailed SJLT and SRHT plumbing. Empirical results show up to 2.5× speedups in serial and up to 35× speedups in distributed settings, with modest compromises in accuracy. The generalized framework allows users to choose sketching operators with provable guarantees, yielding practical, scalable HSS compression for engineers and scientists.
Abstract
We present an extension of an adaptive, partially matrix-free, Hierarchically Semi-Separable (HSS) matrix construction algorithm by Gorman et al. [SIAM J. Sci. Comput. 41(5), 2019] which uses Gaussian sketching operators to a broader class of Johnson--Lindenstrauss (JL) sketching operators. We develop theoretical work which justifies this extension. In particular, we extend the earlier concentration bounds to all JL sketching operators and examine this bound for specific classes of such operators including the original Gaussian sketching operators, subsampled randomized Hadamard transform (SRHT) and the sparse Johnson--Lindenstrauss transform (SJLT). We discuss the implementation details of applying SJLT and SRHT efficiently. Then we demonstrate experimentally that using SJLT or SRHT instead of Gaussian sketching operators leads to up to 2.5x speedups of the serial HSS construction implementation in the STRUMPACK C++ library. Additionally, we discuss the implementation of a parallel distributed HSS construction that leverages Gaussian or SJLT sketching operators. We observe a performance improvement of up to 35x when using SJLT sketching operators over Gaussian sketching operators. The generalized algorithm allows users to select their own JL sketching operators with theoretical lower bounds on the size of the operators which may lead to faster run time with similar HSS construction accuracy.
