Near-optimal Size Linear Sketches for Hypergraph Cut Sparsifiers
Sanjeev Khanna, Aaron L. Putterman, Madhu Sudan
TL;DR
This work extends near-linear-size sparsification from graphs to hypergraphs under linear sketching. By introducing random fingerprinting and k-cut strength decompositions, the authors design a linear-sketch framework that recovers (1±ε)-cut sparsifiers with space \\tilde{O}(n r log(m) / ε^2) and sparsifier size \\tilde{O}(n / ε^2), nearly matching known non-sketching bounds. A matching lower bound shows this dependence on arity $r$ and log(m) is tight up to polylog factors, confirming near-optimality. The results yield dynamic streaming and MPC algorithms with improved space and round complexity, and provide a broad set of techniques (fingerprinting, preprocessing, strength-based sampling) that advance hypergraph sparsification in resource-constrained computation models.
Abstract
A $(1 \pm ε)$-sparsifier of a hypergraph $G(V,E)$ is a (weighted) subgraph that preserves the value of every cut to within a $(1 \pm ε)$-factor. It is known that every hypergraph with $n$ vertices admits a $(1 \pm ε)$-sparsifier with $\tilde{O}(n/ε^2)$ hyperedges. In this work, we explore the task of building such a sparsifier by using only linear measurements (a \emph{linear sketch}) over the hyperedges of $G$, and provide nearly-matching upper and lower bounds for this task. Specifically, we show that there is a randomized linear sketch of size $\widetilde{O}(n r \log(m) / ε^2)$ bits which with high probability contains sufficient information to recover a $(1 \pm ε)$ cut-sparsifier with $\tilde{O}(n/ε^2)$ hyperedges for any hypergraph with at most $m$ edges each of which has arity bounded by $r$. This immediately gives a dynamic streaming algorithm for hypergraph cut sparsification with an identical space complexity, improving on the previous best known bound of $\widetilde{O}(n r^2 \log^4(m) / ε^2)$ bits of space (Guha, McGregor, and Tench, PODS 2015). We complement our algorithmic result above with a nearly-matching lower bound. We show that for every $ε\in (0,1)$, one needs $Ω(nr \log(m/n) / \log(n))$ bits to construct a $(1 \pm ε)$-sparsifier via linear sketching, thus showing that our linear sketch achieves an optimal dependence on both $r$ and $\log(m)$.
