Table of Contents
Fetching ...

Optimal structure learning and conditional independence testing

Ming Gao, Yuhao Wang, Bryon Aragam

TL;DR

The paper establishes a fundamental link between minimax rates for conditional independence (CI) testing and structure learning for poly-forests in DAGs, showing that optimal structure-learning complexity is governed by CI-testing hardness via a general reduction. It derives explicit-rate results across Bernoulli, Gaussian, and nonparametric models, revealing how the CI-testing radius c scales with sample size n as c \asymp n^{-1/\alpha} and how the poly-forest learning cost scales as n \asymp \frac{\log d}{c^{\alpha}}. The authors characterize an efficient, PC-tree-based algorithm that attains these rates when supplied with an optimal CI test, and they validate the theory with experiments across distributional settings. This work provides a unified statistical framework linking CI testing and structure learning, with implications for sample-efficient DAG recovery and practical algorithm design. The results suggest promising extensions to general DAGs and highlight the central role of CI testing as a subroutine for scalable, minimax-optimal structure learning.

Abstract

We establish a fundamental connection between optimal structure learning and optimal conditional independence testing by showing that the minimax optimal rate for structure learning problems is determined by the minimax rate for conditional independence testing in these problems. This is accomplished by establishing a general reduction between these two problems in the case of poly-forests, and demonstrated by deriving optimal rates for several examples, including Bernoulli, Gaussian and nonparametric models. Furthermore, we show that the optimal algorithm in these settings is a suitable modification of the PC algorithm. This theoretical finding provides a unified framework for analyzing the statistical complexity of structure learning through the lens of minimax testing.

Optimal structure learning and conditional independence testing

TL;DR

The paper establishes a fundamental link between minimax rates for conditional independence (CI) testing and structure learning for poly-forests in DAGs, showing that optimal structure-learning complexity is governed by CI-testing hardness via a general reduction. It derives explicit-rate results across Bernoulli, Gaussian, and nonparametric models, revealing how the CI-testing radius c scales with sample size n as c \asymp n^{-1/\alpha} and how the poly-forest learning cost scales as n \asymp \frac{\log d}{c^{\alpha}}. The authors characterize an efficient, PC-tree-based algorithm that attains these rates when supplied with an optimal CI test, and they validate the theory with experiments across distributional settings. This work provides a unified statistical framework linking CI testing and structure learning, with implications for sample-efficient DAG recovery and practical algorithm design. The results suggest promising extensions to general DAGs and highlight the central role of CI testing as a subroutine for scalable, minimax-optimal structure learning.

Abstract

We establish a fundamental connection between optimal structure learning and optimal conditional independence testing by showing that the minimax optimal rate for structure learning problems is determined by the minimax rate for conditional independence testing in these problems. This is accomplished by establishing a general reduction between these two problems in the case of poly-forests, and demonstrated by deriving optimal rates for several examples, including Bernoulli, Gaussian and nonparametric models. Furthermore, we show that the optimal algorithm in these settings is a suitable modification of the PC algorithm. This theoretical finding provides a unified framework for analyzing the statistical complexity of structure learning through the lens of minimax testing.

Paper Structure

This paper contains 41 sections, 11 theorems, 84 equations, 2 figures, 2 algorithms.

Key Result

Theorem 3.1

Given a conditional independence testing problem $\mathcal{C}(\mathcal{P},m,c)$ with an optimal test $\psi$ achieving the minimax testing radius $c\asymp n^{-1/\alpha}$, if there exist hard instances $p_0\in\mathcal{H}_0$ and $p_1\in \mathcal{H}_1$ that are Markov and $c$-strong tree-faithful, then which is achieved by PC-tree with $\psi$.

Figures (2)

  • Figure 1: Structure Hamming distance (SHD) vs. sample size for poly-forest learning for Bernoulli, Gaussian, and nonparametric continuous distributions over varying number of nodes (indicated by colors). Error bars represent standard deviations. SHD consistently decreases toward zero as sample size increases across all experimental settings.
  • Figure 2: Precise Recovery Rate (PRR) vs. sample size for poly-forest learning for Bernoulli, Gaussian, and nonparametric continuous distributions over varying number of nodes (indicated by colors). PRR consistently increase toward 100% as sample size increases across all experimental settings.

Theorems & Definitions (24)

  • Definition 1: Dependence measure
  • Definition 2: $c$-strong tree-faithfulness
  • Example 1: Nonparametric models
  • Definition 3: Conditional independence testing
  • Definition 4: Poly-forest learning problem
  • Theorem 3.1
  • Theorem 4.1
  • Theorem 4.2
  • Theorem 5.1
  • Definition 5: Lipschitzness
  • ...and 14 more