Table of Contents
Fetching ...

Output-sensitive approximate counting via a measure-bounded hyperedge oracle, or: How asymmetry helps estimate $k$-clique counts faster

Keren Censor-Hillel, Tomer Even, Virginia Vassilevska Williams

TL;DR

This work extends the fine-grained reduction from approximate counting to decision to an output-sensitive regime by introducing a measure-bound hyperedge oracle. By defining μ(U) = ∏_{i=1}^k |U∩V_i| and restricting oracle queries to low-measure subgraphs, the authors obtain polylogarithmic-query reductions that yield (1±ε)-approximate counts for hyperedge counts, including k-clique, k-dominating set, and k-sum, with runtimes governed by matrix-multiplication exponents. The core framework combines a recursive heavy/light vertex decomposition with a measure-aware detection oracle, enabling significant improvements in the output-sensitive regime and linking approximation to detection through a generalized “Duplicatable” class of problems. The results provide near-optimal running times in several regimes (e.g., for large k-t, m = n^t cliques, etc.) and establish a versatile methodology for turning detection oracles into fast approximate counting oracles for a broad family of combinatorial counting problems. Practically, this advances exact or near-exact counts in sublinear-regime or with runtimes closely tied to the witness count, offering tighter trade-offs in fine-grained complexity contexts.

Abstract

Dell, Lapinskas and Meeks [DLM SICOMP 2022] presented a general reduction from approximate counting to decision for a class of fine-grained problems that can be viewed as hyperedge counting or detection problems in an implicit hypergraph, thus obtaining tight equivalences between approximate counting and decision for many key problems such as $k$-clique, $k$-sum and more. Their result is a reduction from approximately counting the number of hyperedges in an implicit $k$-partite hypergraph to a polylogarithmic number of calls to a hyperedge oracle that returns whether a given subhypergraph contains an edge. The main result of this paper is a generalization of the DLM result for {\em output-sensitive} approximate counting, where the running time of the desired counting algorithm is inversely proportional to the number of witnesses. Our theorem is a reduction from approximately counting the (unknown) number of hyperedges in an implicit $k$-partite hypergraph to a polylogarithmic number of calls to a hyperedge oracle called only on subhypergraphs with a small ``measure''. If a subhypergraph has $u_i$ nodes in the $i$th node partition of the $k$-partite hypergraph, then its measure is $\prod_i u_i$. Using the new general reduction and by efficiently implementing measure-bounded colorful independence oracles, we obtain new improved output-sensitive approximate counting algorithms for $k$-clique, $k$-dominating set and $k$-sum. In graphs with $n^t$ $k$-cliques, for instance, our algorithm $(1\pm ε)$-approximates the $k$-clique count in time $$\tilde{O}_ε(n^{ω(\frac{k-t-1}{3},\frac{k-t}{3},\frac{k-t+2}{3}) }+n^2),$$ where $ω(a,b,c)$ is the exponent of $n^a\times n^b$ by $n^b\times n^c$ matrix multiplication. For large $k$ and $t>2$, this is a substantial improvement over prior work, even if $ω=2$.

Output-sensitive approximate counting via a measure-bounded hyperedge oracle, or: How asymmetry helps estimate $k$-clique counts faster

TL;DR

This work extends the fine-grained reduction from approximate counting to decision to an output-sensitive regime by introducing a measure-bound hyperedge oracle. By defining μ(U) = ∏_{i=1}^k |U∩V_i| and restricting oracle queries to low-measure subgraphs, the authors obtain polylogarithmic-query reductions that yield (1±ε)-approximate counts for hyperedge counts, including k-clique, k-dominating set, and k-sum, with runtimes governed by matrix-multiplication exponents. The core framework combines a recursive heavy/light vertex decomposition with a measure-aware detection oracle, enabling significant improvements in the output-sensitive regime and linking approximation to detection through a generalized “Duplicatable” class of problems. The results provide near-optimal running times in several regimes (e.g., for large k-t, m = n^t cliques, etc.) and establish a versatile methodology for turning detection oracles into fast approximate counting oracles for a broad family of combinatorial counting problems. Practically, this advances exact or near-exact counts in sublinear-regime or with runtimes closely tied to the witness count, offering tighter trade-offs in fine-grained complexity contexts.

Abstract

Dell, Lapinskas and Meeks [DLM SICOMP 2022] presented a general reduction from approximate counting to decision for a class of fine-grained problems that can be viewed as hyperedge counting or detection problems in an implicit hypergraph, thus obtaining tight equivalences between approximate counting and decision for many key problems such as -clique, -sum and more. Their result is a reduction from approximately counting the number of hyperedges in an implicit -partite hypergraph to a polylogarithmic number of calls to a hyperedge oracle that returns whether a given subhypergraph contains an edge. The main result of this paper is a generalization of the DLM result for {\em output-sensitive} approximate counting, where the running time of the desired counting algorithm is inversely proportional to the number of witnesses. Our theorem is a reduction from approximately counting the (unknown) number of hyperedges in an implicit -partite hypergraph to a polylogarithmic number of calls to a hyperedge oracle called only on subhypergraphs with a small ``measure''. If a subhypergraph has nodes in the th node partition of the -partite hypergraph, then its measure is . Using the new general reduction and by efficiently implementing measure-bounded colorful independence oracles, we obtain new improved output-sensitive approximate counting algorithms for -clique, -dominating set and -sum. In graphs with -cliques, for instance, our algorithm -approximates the -clique count in time where is the exponent of by matrix multiplication. For large and , this is a substantial improvement over prior work, even if .

Paper Structure

This paper contains 49 sections, 48 theorems, 78 equations, 4 figures, 2 algorithms.

Key Result

Theorem 1

Let $G$ be a $k$-partite hypergraph with vertex set $V^{\mathrm{input}}=V^{\mathrm{input}}_1\sqcup V^{\mathrm{input}}_2\sqcup \ldots\sqcup V^{\mathrm{input}}_k$, where $\lvert*\rvert{V^{\mathrm{input}}}=n$, and a set $E$ of $m$ (unknown) hyperedges. There exists a randomized algorithm Count$(G,\vare

Figures (4)

  • Figure 1: The above figures illustrate the comparison between our work and prior work for $k=4$ and $k=5$ in the $k$-clique problem. The blue line depicts the running time of nesetril, which does not depend on $m$. The orange line depicts the running time of the sampling algorithm. The green line depicts our running time specified in \ref{['thms:clique']}, and the red line depicts our running time specified in \ref{['thms:k4']}. \ref{['fig:clique0', 'fig:ds0']} are plotted based on RenMM2021.
  • Figure 2: The above figures illustrate the comparison between our work and prior work for $k=3$ and $k=8$ in the $k$-DS problem. The blue line depicts the running time of eisenbrand2004complexity, which does not depend on $m$. The orange line depicts the running time of the sampling algorithm. The green line depicts our running time specified in \ref{['thms:DS']}, and the red line depicts our running time specified in \ref{['thms:ds3']}.
  • Figure 3: The above figures illustrate the comparison between our work and prior work for $k=3$ and $k=9$ in the $k$-sum problem. The blue line depicts the folklore running time of $n^{\lceil k/2\rceil}$, which does not depend on $m$. The orange line depicts the running time of the sampling algorithm. The green line depicts our running time specified in \ref{['thms:sum']}, and the red line depicts our running time specified in \ref{['thms:sum3']}.
  • Figure 4: This figure presents the call flow of the different algorithms. A directed edge from algorithm $A$ to algorithm $B$ indicates that algorithm $A$ makes a call to algorithm $B$. A value $x$ on such edge indicates an upper bound on the number of times that algorithm $A$ makes a call to algorithm $B$. This figure explains the structure of the recursion tree created by the algorithm $\mathbf{GuessApx}$. An algorithm with out-degree zero is a leaf in the recursion tree, where only leaves query the Hyperedge-Oracle.

Theorems & Definitions (63)

  • Theorem 1: Dell-Lapinskas-Meeks'22
  • Theorem 2: Main Theorem
  • Theorem 3: Clique (Simplified)
  • Theorem 4: $4$-clique
  • Theorem 5: $k$-DS (Simplified)
  • Theorem 6: $3$-dominating set
  • Theorem 7: $k$-Sum (Simplified)
  • Theorem 8: $3$-Sum
  • Theorem 9: DellLM22bhattacharya2024faster
  • Claim 10
  • ...and 53 more