Cut Sparsification and Succinct Representation of Submodular Hypergraphs
Yotam Kenneth, Robert Krauthgamer
TL;DR
This work extends cut sparsification to submodular hypergraphs by introducing polynomial-size sparsifiers that approximate all cuts within a factor $1+\epsilon$ and by introducing a spread-based parameter to tighten sparsifier size in finite-spread cases. It also develops a framework for succinct representation, showing that for additive splitting functions one can achieve encodings with near-linear space in $n$, via a deformation approach that replaces large hyperedges with many small ones, while proving that reweighted sparsifiers can impose substantially larger encoding sizes and in some cases are provably space-inefficient. The authors introduce the concept of deformation, provide upper and lower bounds for deformations of common splitting functions, and derive encoding-size lower bounds that separate reweighted sparsifiers from more compact encodings. Together, these results illuminate the trade-offs between sparsification, succinct encoding, and deformation in submodular hypergraphs, with implications for clustering, ML, and higher-order optimization.
Abstract
In cut sparsification, all cuts of a hypergraph $H=(V,E,w)$ are approximated within $1\pmε$ factor by a small hypergraph $H'$. This widely applied method was generalized recently to a setting where the cost of cutting each hyperedge $e$ is provided by a splitting function $g_e: 2^e\to\mathbb{R}_+$. This generalization is called a submodular hypergraph when the functions $\{g_e\}_{e\in E}$ are submodular, and it arises in machine learning, combinatorial optimization, and algorithmic game theory. Previous work studied the setting where $H'$ is a reweighted sub-hypergraph of $H$, and measured the size of $H'$ by the number of hyperedges in it. In this setting, we present two results: (i) all submodular hypergraphs admit sparsifiers of size polynomial in $n=|V|$ and $ε^{-1}$; (ii) we propose a new parameter, called spread, and use it to obtain smaller sparsifiers in some cases. We also show that for a natural family of splitting functions, relaxing the requirement that $H'$ be a reweighted sub-hypergraph of $H$ yields a substantially smaller encoding of the cuts of $H$ (almost a factor $n$ in the number of bits). This is in contrast to graphs, where the most succinct representation is attained by reweighted subgraphs. A new tool in our construction of succinct representation is the notion of deformation, where a splitting function $g_e$ is decomposed into a sum of functions of small description, and we provide upper and lower bounds for deformation of common splitting functions.
