Adversarially-Robust Inference on Trees via Belief Propagation

Samuel B. Hopkins; Anqi Li

Adversarially-Robust Inference on Trees via Belief Propagation

Samuel B. Hopkins, Anqi Li

TL;DR

The paper analyzes posterior inference on depth-$t$, $d$-regular trees under adversarial leaf corruption. It shows a sharp distinction between worst-case and semirandom adversaries: a ρ-fraction adversary can render the root unidentifiable, while a semirandom adversary with ρ small enough allows accurate root inference when the signal-to-noise ratio $d\varepsilon^2$ exceeds a logarithmic factor above the KS threshold, with belief propagation achieving this robust recovery. The main technique is a contraction argument for belief propagation, carefully handling long-range adversarial dependencies by decomposing adversaries into local components and, in the large-ε regime, employing truncation and derivative bounds. The results also connect robustness to model misspecification, provide a two-stage algorithm for spread adversaries, and establish information-theoretic lower bounds detailing the limits of adversarial robustness. Overall, the work demonstrates information-theoretically optimal corruption tolerance up to a constant factor and identifies open questions about robustness exactly at or below the KS threshold, with implications for robust Bayesian inference on tree-structured models.

Abstract

We introduce and study the problem of posterior inference on tree-structured graphical models in the presence of a malicious adversary who can corrupt some observed nodes. In the well-studied broadcasting on trees model, corresponding to the ferromagnetic Ising model on a $d$-regular tree with zero external field, when a natural signal-to-noise ratio exceeds one (the celebrated Kesten-Stigum threshold), the posterior distribution of the root given the leaves is bounded away from $\mathrm{Ber}(1/2)$, and carries nontrivial information about the sign of the root. This posterior distribution can be computed exactly via dynamic programming, also known as belief propagation. We first confirm a folklore belief that a malicious adversary who can corrupt an inverse-polynomial fraction of the leaves of their choosing makes this inference impossible. Our main result is that accurate posterior inference about the root vertex given the leaves is possible when the adversary is constrained to make corruptions at a $ρ$-fraction of randomly-chosen leaf vertices, so long as the signal-to-noise ratio exceeds $O(\log d)$ and $ρ\leq c \varepsilon$ for some universal $c > 0$. Since inference becomes information-theoretically impossible when $ρ\gg \varepsilon$, this amounts to an information-theoretically optimal fraction of corruptions, up to a constant multiplicative factor. Furthermore, we show that the canonical belief propagation algorithm performs this inference.

Adversarially-Robust Inference on Trees via Belief Propagation

TL;DR

The paper analyzes posterior inference on depth-

-regular trees under adversarial leaf corruption. It shows a sharp distinction between worst-case and semirandom adversaries: a ρ-fraction adversary can render the root unidentifiable, while a semirandom adversary with ρ small enough allows accurate root inference when the signal-to-noise ratio

exceeds a logarithmic factor above the KS threshold, with belief propagation achieving this robust recovery. The main technique is a contraction argument for belief propagation, carefully handling long-range adversarial dependencies by decomposing adversaries into local components and, in the large-ε regime, employing truncation and derivative bounds. The results also connect robustness to model misspecification, provide a two-stage algorithm for spread adversaries, and establish information-theoretic lower bounds detailing the limits of adversarial robustness. Overall, the work demonstrates information-theoretically optimal corruption tolerance up to a constant factor and identifies open questions about robustness exactly at or below the KS threshold, with implications for robust Bayesian inference on tree-structured models.

Abstract

-regular tree with zero external field, when a natural signal-to-noise ratio exceeds one (the celebrated Kesten-Stigum threshold), the posterior distribution of the root given the leaves is bounded away from

, and carries nontrivial information about the sign of the root. This posterior distribution can be computed exactly via dynamic programming, also known as belief propagation. We first confirm a folklore belief that a malicious adversary who can corrupt an inverse-polynomial fraction of the leaves of their choosing makes this inference impossible. Our main result is that accurate posterior inference about the root vertex given the leaves is possible when the adversary is constrained to make corruptions at a

-fraction of randomly-chosen leaf vertices, so long as the signal-to-noise ratio exceeds

and

for some universal

. Since inference becomes information-theoretically impossible when

, this amounts to an information-theoretically optimal fraction of corruptions, up to a constant multiplicative factor. Furthermore, we show that the canonical belief propagation algorithm performs this inference.

Paper Structure (26 sections, 16 theorems, 106 equations, 1 figure)

This paper contains 26 sections, 16 theorems, 106 equations, 1 figure.

Introduction
Organization
Results
Open question: robustness down to the KS threshold
Adversarial Corruption versus Model Misspecification
Background: corruptions and misspecification in frequentist statistics
Misspecified Bayesian models
Interpreting our results as robustness to model misspecification
Related Work
Algorithmic Robust Statistics
Broadcasting on Trees
Robust Bayesian Inference
Overview of Proofs
Optimal recovery from $\rho$-semirandom adversary
Information-theoretic lower bound: Proof of Theorem \ref{['thm:lower-bound-intro']}
...and 11 more sections

Key Result

Theorem 1.2

There exists $\varepsilon_0 > 0$ such that for every $\rho$, $d$ and $\varepsilon < \varepsilon_0$, there exists a $\varepsilon^{O(t)}$-fraction adversary $A$ such that if $(\sigma_R,\sigma_L)$ is distributed according to the broadcast process with parameters $d,\varepsilon$,

Figures (1)

Figure 1: An illustration of the partition of the levels of the tree in the algorithm.

Theorems & Definitions (42)

Definition 1.1: Broadcasting and inference in regular trees
Theorem 1.2: Proof in Section \ref{['sec:info-theory']}
Definition 1.3: $\rho$-semirandom adversary
Theorem 1.4: Main theorem, follows from Lemma \ref{['lem:small-eps']} and Lemma \ref{['lem:contract-large']}
Definition 1.5
Corollary 1.6
Definition 3.1
Lemma 3.2
Definition 3.3
Lemma 3.4
...and 32 more

Adversarially-Robust Inference on Trees via Belief Propagation

TL;DR

Abstract

Adversarially-Robust Inference on Trees via Belief Propagation

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (42)