A comparison of two effective methods for reordering columns within supernodes
M. Ozan Karsavuran, Esmond G. Ng, Barry W. Peyton
TL;DR
This work addresses reordering columns within supernodes to optimize sparse Cholesky factorization by studying two approaches, the traveling salesman problem (TSP) based method and partition refinement (PR). It uses the right-looking blocked sparse Cholesky (RLB) framework as the testbed and applies practical improvements to both reorderings, ultimately comparing their impact on factorization time and memory on a 48-core platform with MKL. The experiments on 21 large matrices from the SuiteSparse collection show that TSP and PR yield virtually equal ordering quality, but PR incurs far lower overhead in both time and storage, making PR the method of choice in practice. The study provides a fair, detailed benchmark and concrete guidance for implementing sparse Cholesky solvers on multicore systems, highlighting that clever intra-supernode reordering with PR can substantially improve performance without the cost of TSP-based methods.
Abstract
In some recent papers, researchers have found two very good methods for reordering columns within supernodes in sparse Cholesky factors; these reorderings can be very useful for certain factorization methods. The first of these reordering methods is based on modeling the underlying problem as a traveling salesman problem (TSP), and the second of these methods is based on partition refinement (PR). In this paper, we devise a fair way to compare the two methods. While the two methods are virtually the same in the quality of the reorderings that they produce, PR should be the method of choice because PR reorderings can be computed using far less time and storage than TSP reorderings.
