Faster Estimation of the Average Degree of a Graph Using Random Edges and Structural Queries

Lorenzo Beretta; Deeparnab Chakrabarty; C. Seshadhri

Faster Estimation of the Average Degree of a Graph Using Random Edges and Structural Queries

Lorenzo Beretta, Deeparnab Chakrabarty, C. Seshadhri

TL;DR

This work develops sublinear algorithms for estimating the graph’s average degree across a spectrum of access models, leveraging random edges and structural queries to beat classic bounds. It introduces a hybrid framework combining heavy-vertex analysis, harmonic estimators, and density estimation over quasi-regular subgraphs to achieve tight upper and lower bounds: $ ilde{O}(n^{1/4})$ queries in the standard model with RandEdge and $ ilde{O}(n^{1/5})$ with full neighborhood access when $n$ is known, and corresponding tradeoffs in additive and unknown-$n$ settings. The results show that structural queries substantially improve efficiency when $n$ is known, while in unknown-$n$ scenarios some gains vanish unless powerful additiv e oracles are available. The work also provides nearly matching lower bounds and discusses the nuanced impact of unknown $n$ and additional query types on the fundamental limits of degree estimation in sublinear time.

Abstract

We revisit the problem of designing sublinear algorithms for estimating the average degree of an $n$-vertex graph. The standard access model for graphs allows for the following queries: sampling a uniform random vertex, the degree of a vertex, sampling a uniform random neighbor of a vertex, and ``pair queries'' which determine if a pair of vertices form an edge. In this model, original results [Goldreich-Ron, RSA 2008; Eden-Ron-Seshadhri, SIDMA 2019] on this problem prove that the complexity of getting $(1+\varepsilon)$-multiplicative approximations to the average degree, ignoring $\varepsilon$-dependencies, is $Θ(\sqrt{n})$. When random edges can be sampled, it is known that the average degree can estimated in $\widetilde{O}(n^{1/3})$ queries, even without pair queries [Motwani-Panigrahy-Xu, ICALP 2007; Beretta-Tetek, TALG 2024]. We give a nearly optimal algorithm in the standard access model with random edge samples. Our algorithm makes $\widetilde{O}(n^{1/4})$ queries exploiting the power of pair queries. We also analyze the ``full neighborhood access" model wherein the entire adjacency list of a vertex can be obtained with a single query; this model is relevant in many practical applications. In a weaker version of this model, we give an algorithm that makes $\widetilde{O}(n^{1/5})$ queries. Both these results underscore the power of {\em structural queries}, such as pair queries and full neighborhood access queries, for estimating the average degree. We give nearly matching lower bounds, ignoring $\varepsilon$-dependencies, for all our results. So far, almost all algorithms for estimating average degree assume that the number of vertices, $n$, is known. Inspired by [Beretta-Tetek, TALG 2024], we study this problem when $n$ is unknown and show that structural queries do not help in estimating average degree in this setting.

Faster Estimation of the Average Degree of a Graph Using Random Edges and Structural Queries

TL;DR

queries in the standard model with RandEdge and

with full neighborhood access when

is known, and corresponding tradeoffs in additive and unknown-

settings. The results show that structural queries substantially improve efficiency when

is known, while in unknown-

scenarios some gains vanish unless powerful additiv e oracles are available. The work also provides nearly matching lower bounds and discusses the nuanced impact of unknown

and additional query types on the fundamental limits of degree estimation in sublinear time.

Abstract

We revisit the problem of designing sublinear algorithms for estimating the average degree of an

-vertex graph. The standard access model for graphs allows for the following queries: sampling a uniform random vertex, the degree of a vertex, sampling a uniform random neighbor of a vertex, and ``pair queries'' which determine if a pair of vertices form an edge. In this model, original results [Goldreich-Ron, RSA 2008; Eden-Ron-Seshadhri, SIDMA 2019] on this problem prove that the complexity of getting

-multiplicative approximations to the average degree, ignoring

-dependencies, is

. When random edges can be sampled, it is known that the average degree can estimated in

queries, even without pair queries [Motwani-Panigrahy-Xu, ICALP 2007; Beretta-Tetek, TALG 2024]. We give a nearly optimal algorithm in the standard access model with random edge samples. Our algorithm makes

queries exploiting the power of pair queries. We also analyze the ``full neighborhood access" model wherein the entire adjacency list of a vertex can be obtained with a single query; this model is relevant in many practical applications. In a weaker version of this model, we give an algorithm that makes

queries. Both these results underscore the power of {\em structural queries}, such as pair queries and full neighborhood access queries, for estimating the average degree. We give nearly matching lower bounds, ignoring

-dependencies, for all our results. So far, almost all algorithms for estimating average degree assume that the number of vertices,

, is known. Inspired by [Beretta-Tetek, TALG 2024], we study this problem when

is unknown and show that structural queries do not help in estimating average degree in this setting.

Faster Estimation of the Average Degree of a Graph Using Random Edges and Structural Queries

TL;DR

Abstract

Faster Estimation of the Average Degree of a Graph Using Random Edges and Structural Queries

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (88)