Table of Contents
Fetching ...

Faster Estimation of the Average Degree of a Graph Using Random Edges and Structural Queries

Lorenzo Beretta, Deeparnab Chakrabarty, C. Seshadhri

TL;DR

This work develops sublinear algorithms for estimating the graph’s average degree across a spectrum of access models, leveraging random edges and structural queries to beat classic bounds. It introduces a hybrid framework combining heavy-vertex analysis, harmonic estimators, and density estimation over quasi-regular subgraphs to achieve tight upper and lower bounds: $ ilde{O}(n^{1/4})$ queries in the standard model with RandEdge and $ ilde{O}(n^{1/5})$ with full neighborhood access when $n$ is known, and corresponding tradeoffs in additive and unknown-$n$ settings. The results show that structural queries substantially improve efficiency when $n$ is known, while in unknown-$n$ scenarios some gains vanish unless powerful additiv e oracles are available. The work also provides nearly matching lower bounds and discusses the nuanced impact of unknown $n$ and additional query types on the fundamental limits of degree estimation in sublinear time.

Abstract

We revisit the problem of designing sublinear algorithms for estimating the average degree of an $n$-vertex graph. The standard access model for graphs allows for the following queries: sampling a uniform random vertex, the degree of a vertex, sampling a uniform random neighbor of a vertex, and ``pair queries'' which determine if a pair of vertices form an edge. In this model, original results [Goldreich-Ron, RSA 2008; Eden-Ron-Seshadhri, SIDMA 2019] on this problem prove that the complexity of getting $(1+\varepsilon)$-multiplicative approximations to the average degree, ignoring $\varepsilon$-dependencies, is $Θ(\sqrt{n})$. When random edges can be sampled, it is known that the average degree can estimated in $\widetilde{O}(n^{1/3})$ queries, even without pair queries [Motwani-Panigrahy-Xu, ICALP 2007; Beretta-Tetek, TALG 2024]. We give a nearly optimal algorithm in the standard access model with random edge samples. Our algorithm makes $\widetilde{O}(n^{1/4})$ queries exploiting the power of pair queries. We also analyze the ``full neighborhood access" model wherein the entire adjacency list of a vertex can be obtained with a single query; this model is relevant in many practical applications. In a weaker version of this model, we give an algorithm that makes $\widetilde{O}(n^{1/5})$ queries. Both these results underscore the power of {\em structural queries}, such as pair queries and full neighborhood access queries, for estimating the average degree. We give nearly matching lower bounds, ignoring $\varepsilon$-dependencies, for all our results. So far, almost all algorithms for estimating average degree assume that the number of vertices, $n$, is known. Inspired by [Beretta-Tetek, TALG 2024], we study this problem when $n$ is unknown and show that structural queries do not help in estimating average degree in this setting.

Faster Estimation of the Average Degree of a Graph Using Random Edges and Structural Queries

TL;DR

This work develops sublinear algorithms for estimating the graph’s average degree across a spectrum of access models, leveraging random edges and structural queries to beat classic bounds. It introduces a hybrid framework combining heavy-vertex analysis, harmonic estimators, and density estimation over quasi-regular subgraphs to achieve tight upper and lower bounds: queries in the standard model with RandEdge and with full neighborhood access when is known, and corresponding tradeoffs in additive and unknown- settings. The results show that structural queries substantially improve efficiency when is known, while in unknown- scenarios some gains vanish unless powerful additiv e oracles are available. The work also provides nearly matching lower bounds and discusses the nuanced impact of unknown and additional query types on the fundamental limits of degree estimation in sublinear time.

Abstract

We revisit the problem of designing sublinear algorithms for estimating the average degree of an -vertex graph. The standard access model for graphs allows for the following queries: sampling a uniform random vertex, the degree of a vertex, sampling a uniform random neighbor of a vertex, and ``pair queries'' which determine if a pair of vertices form an edge. In this model, original results [Goldreich-Ron, RSA 2008; Eden-Ron-Seshadhri, SIDMA 2019] on this problem prove that the complexity of getting -multiplicative approximations to the average degree, ignoring -dependencies, is . When random edges can be sampled, it is known that the average degree can estimated in queries, even without pair queries [Motwani-Panigrahy-Xu, ICALP 2007; Beretta-Tetek, TALG 2024]. We give a nearly optimal algorithm in the standard access model with random edge samples. Our algorithm makes queries exploiting the power of pair queries. We also analyze the ``full neighborhood access" model wherein the entire adjacency list of a vertex can be obtained with a single query; this model is relevant in many practical applications. In a weaker version of this model, we give an algorithm that makes queries. Both these results underscore the power of {\em structural queries}, such as pair queries and full neighborhood access queries, for estimating the average degree. We give nearly matching lower bounds, ignoring -dependencies, for all our results. So far, almost all algorithms for estimating average degree assume that the number of vertices, , is known. Inspired by [Beretta-Tetek, TALG 2024], we study this problem when is unknown and show that structural queries do not help in estimating average degree in this setting.

Paper Structure

This paper contains 20 sections, 29 theorems, 31 equations, 1 table, 12 algorithms.

Key Result

Theorem 1.1

Consider the standard model with $n$ known and $\mathbf{RandEdge}$ queries. (a) There is an algorithm that outputs a $(1 + \varepsilon)$-approximation to $d$ in $\widetilde{O}\left(\varepsilon^{-2}\min\left(d, \sqrt[3]{n/d}\right)\right)$ queries. In particular, we get an $(1 + \varepsilon)$-approxi

Theorems & Definitions (88)

  • Remark 1
  • Theorem 1.1
  • Theorem 1.2
  • Theorem 1.3
  • Theorem 1.4
  • Theorem 1.5
  • Theorem 1.6
  • Lemma 2.3
  • Lemma 2.4
  • proof
  • ...and 78 more