Near-Optimal Experiment Design in Linear non-Gaussian Cyclic Models

Ehsan Sharifian; Saber Salehkaleybar; Negar Kiyavash

Near-Optimal Experiment Design in Linear non-Gaussian Cyclic Models

Ehsan Sharifian, Saber Salehkaleybar, Negar Kiyavash

TL;DR

<3-5 sentence high-level summary>Addresses causal structure learning in cyclic linear non-Gaussian SCMs, showing observational data yields a permutation-equivalence class of graphs. The authors introduce a bipartite-matching representation of this class and prove that single-node interventions progressively reveal true matching edges, shrinking the class. They formulate adaptive experiment design as a submodular optimization with a sampling-based reward estimator, enabling near-optimal greedy strategies with guarantees. Empirical results on synthetic graphs demonstrate strong performance, closely approaching the fundamental FVS lower bound and robust behavior under finite-sample ICA, with extensions to multi-node interventions and non-linear settings discussed.

Abstract

We study the problem of causal structure learning from a combination of observational and interventional data generated by a linear non-Gaussian structural equation model that might contain cycles. Recent results show that using mere observational data identifies the causal graph only up to a permutation-equivalence class. We obtain a combinatorial characterization of this class by showing that each graph in an equivalence class corresponds to a perfect matching in a bipartite graph. This bipartite representation allows us to analyze how interventions modify or constrain the matchings. Specifically, we show that each atomic intervention reveals one edge of the true matching and eliminates all incompatible causal graphs. Consequently, we formalize the optimal experiment design task as an adaptive stochastic optimization problem over the set of equivalence classes with a natural reward function that quantifies how many graphs are eliminated from the equivalence class by an intervention. We show that this reward function is adaptive submodular and provide a greedy policy with a provable near-optimal performance guarantee. A key technical challenge is to efficiently estimate the reward function without having to explicitly enumerate all the graphs in the equivalence class. We propose a sampling-based estimator using random matchings and analyze its bias and concentration behavior. Our simulation results show that performing a small number of interventions guided by our stochastic optimization framework recovers the true underlying causal structure.

Near-Optimal Experiment Design in Linear non-Gaussian Cyclic Models

TL;DR

Abstract

Near-Optimal Experiment Design in Linear non-Gaussian Cyclic Models

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (23)