Table of Contents
Fetching ...

A fast score-based search algorithm for maximal ancestral graphs using entropy

Zhongyi Hu, Robin Evans

TL;DR

This work tackles learning maximal ancestral graphs (MAGs) from data by replacing unstable BIC-based scoring with an entropy-based imset score grounded in the refined Markov property (ROMP). It develops a polynomial-time, greedy search over Markov equivalence classes using partial ancestral graphs (PAGs) and a novel branching strategy for discriminating paths, under Meek-like MAG conjecture assumptions. The proposed score S_G^r = -2N⟨δ_V − u_G^r, ŜH⟩ + d log N, together with ROMP and controlled head-size, yields a decomposable, consistent objective and scalable MEC navigation. Empirical results on simulated linear Gaussian MAGs show superior edge-accuracy and competitive BIC gaps compared to BIC-based and constraint-based baselines, illustrating practical benefits for causal discovery in the presence of latent confounders. The approach highlights a path toward efficient, principled MAG learning with principled handling of equivalence classes and finite-sample corrections.

Abstract

\emph{Maximal ancestral graph} (MAGs) is a class of graphical model that extend the famous \emph{directed acyclic graph} in the presence of latent confounders. Most score-based approaches to learn the unknown MAG from empirical data rely on BIC score which suffers from instability and heavy computations. We propose to use the framework of imsets \citep{studeny2006probabilistic} to score MAGs using empirical entropy estimation and the newly proposed \emph{refined Markov property} \citep{hu2023towards}. Our graphical search procedure is similar to \citet{claassen2022greedy} but improved from our theoretical results. We show that our search algorithm is polynomial in number of nodes by restricting degree, maximal head size and number of discriminating paths. In simulated experiment, our algorithm shows superior performance compared to other state of art MAG learning algorithms.

A fast score-based search algorithm for maximal ancestral graphs using entropy

TL;DR

This work tackles learning maximal ancestral graphs (MAGs) from data by replacing unstable BIC-based scoring with an entropy-based imset score grounded in the refined Markov property (ROMP). It develops a polynomial-time, greedy search over Markov equivalence classes using partial ancestral graphs (PAGs) and a novel branching strategy for discriminating paths, under Meek-like MAG conjecture assumptions. The proposed score S_G^r = -2N⟨δ_V − u_G^r, ŜH⟩ + d log N, together with ROMP and controlled head-size, yields a decomposable, consistent objective and scalable MEC navigation. Empirical results on simulated linear Gaussian MAGs show superior edge-accuracy and competitive BIC gaps compared to BIC-based and constraint-based baselines, illustrating practical benefits for causal discovery in the presence of latent confounders. The approach highlights a path toward efficient, principled MAG learning with principled handling of equivalence classes and finite-sample corrections.

Abstract

\emph{Maximal ancestral graph} (MAGs) is a class of graphical model that extend the famous \emph{directed acyclic graph} in the presence of latent confounders. Most score-based approaches to learn the unknown MAG from empirical data rely on BIC score which suffers from instability and heavy computations. We propose to use the framework of imsets \citep{studeny2006probabilistic} to score MAGs using empirical entropy estimation and the newly proposed \emph{refined Markov property} \citep{hu2023towards}. Our graphical search procedure is similar to \citet{claassen2022greedy} but improved from our theoretical results. We show that our search algorithm is polynomial in number of nodes by restricting degree, maximal head size and number of discriminating paths. In simulated experiment, our algorithm shows superior performance compared to other state of art MAG learning algorithms.
Paper Structure (43 sections, 15 theorems, 15 equations, 29 figures, 1 table, 1 algorithm)

This paper contains 43 sections, 15 theorems, 15 equations, 29 figures, 1 table, 1 algorithm.

Key Result

Theorem 2.1

Let ${\cal G}_{1}$ and ${\cal G}_{2}$ be two MAGs. Then ${\cal G}_{1}$ and ${\cal G}_{2}$ are Markov equivalent if and only if $\mathcal{S}({\cal G}_{1})=\mathcal{S}({\cal G}_{2})$

Figures (29)

  • Figure 1: (i) An ancestral graph that is not maximal. (ii) A maximal graph that is not ancestral. (iii) A maximal ancestral graph.
  • Figure 2: An example for Definition \ref{['one head to another head']}.
  • Figure 3: A DAG with 4 nodes
  • Figure 4: Examples for redundant triples
  • Figure 5: A PAG with a new adjacency
  • ...and 24 more figures

Theorems & Definitions (47)

  • Definition 2.1
  • Definition 2.2
  • Definition 2.3
  • Definition 2.4
  • Definition 2.5
  • Theorem 2.1
  • Proposition 2.2
  • Example 2.1
  • Definition 2.6
  • Lemma 2.3
  • ...and 37 more