Adversarially Robust Dense-Sparse Tradeoffs via Heavy-Hitters

David P. Woodruff; Samson Zhou

Adversarially Robust Dense-Sparse Tradeoffs via Heavy-Hitters

David P. Woodruff, Samson Zhou

TL;DR

This work studies adversarially robust streaming algorithms for $L_p$ estimation on turnstile streams, addressing adaptivity-induced failures. It introduces a heavy-hitter component (RobustHH) that blends a deterministic heavy-hitter routine for small universes with a robust CountSketch variant for large universes, guided by an $L_0$ estimator, along with a residual-tail estimator (ResidualEst) whose additive guarantees depend only on the tail, not its size $k$. The combination yields an improved adversarially robust $L_p$ estimator with space $\tilde{O}\left(m^c\right)$ for $p\in(1,2)$ and $c<\frac{p}{2p+1}$, plus a special-case heavy-hitter bound and a residual-focused estimation framework with space poly$(1/\varepsilon,\log n)$. Empirically, on the CAIDA dataset, the residual-based approach achieves substantially smaller flip numbers and practical space savings, demonstrating robustness and scalability under adaptive inputs.

Abstract

In the adversarial streaming model, the input is a sequence of adaptive updates that defines an underlying dataset and the goal is to approximate, collect, or compute some statistic while using space sublinear in the size of the dataset. In 2022, Ben-Eliezer, Eden, and Onak showed a dense-sparse trade-off technique that elegantly combined sparse recovery with known techniques using differential privacy and sketch switching to achieve adversarially robust algorithms for $L_p$ estimation and other algorithms on turnstile streams. In this work, we first give an improved algorithm for adversarially robust $L_p$-heavy hitters, utilizing deterministic turnstile heavy-hitter algorithms with better tradeoffs. We then utilize our heavy-hitter algorithm to reduce the problem to estimating the frequency moment of the tail vector. We give a new algorithm for this problem in the classical streaming setting, which achieves additive error and uses space independent in the size of the tail. We then leverage these ingredients to give an improved algorithm for adversarially robust $L_p$ estimation on turnstile streams.

Adversarially Robust Dense-Sparse Tradeoffs via Heavy-Hitters

TL;DR

This work studies adversarially robust streaming algorithms for

estimation on turnstile streams, addressing adaptivity-induced failures. It introduces a heavy-hitter component (RobustHH) that blends a deterministic heavy-hitter routine for small universes with a robust CountSketch variant for large universes, guided by an

estimator, along with a residual-tail estimator (ResidualEst) whose additive guarantees depend only on the tail, not its size

. The combination yields an improved adversarially robust

estimator with space

for

and

, plus a special-case heavy-hitter bound and a residual-focused estimation framework with space poly

. Empirically, on the CAIDA dataset, the residual-based approach achieves substantially smaller flip numbers and practical space savings, demonstrating robustness and scalability under adaptive inputs.

Abstract

estimation and other algorithms on turnstile streams. In this work, we first give an improved algorithm for adversarially robust

-heavy hitters, utilizing deterministic turnstile heavy-hitter algorithms with better tradeoffs. We then utilize our heavy-hitter algorithm to reduce the problem to estimating the frequency moment of the tail vector. We give a new algorithm for this problem in the classical streaming setting, which achieves additive error and uses space independent in the size of the tail. We then leverage these ingredients to give an improved algorithm for adversarially robust

estimation on turnstile streams.

Adversarially Robust Dense-Sparse Tradeoffs via Heavy-Hitters

TL;DR

Abstract

Adversarially Robust Dense-Sparse Tradeoffs via Heavy-Hitters

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (43)