Fast Proxy Experiment Design for Causal Effect Identification
Sepehr Elahi, Sina Akbari, Jalal Etesami, Negar Kiyavash, Patrick Thiran
TL;DR
This paper tackles identifying causal effects via proxy experiments by reframing the minimum-cost intervention design (MCID) problem for causal identification in ADMGs. It introduces reformulations of MCID as a weighted partial MAX-SAT (WPMAX-SAT) problem and as an ILP, along with submodular and reinforcement-learning perspectives, enabling faster exact solutions. A SAT-construction procedure and extensions to multiple districts yield practical, scalable solvers that significantly outperform the previous state of the art, with experimental results showing substantial reductions in runtime. Additionally, the authors develop a generalized-adjustment framework that provides a polynomial-time heuristic and a principled route to identify adjustment-based designs, enhancing applicability to real-world settings where costly interventions must be minimized while ensuring identifiability.
Abstract
Identifying causal effects is a key problem of interest across many disciplines. The two long-standing approaches to estimate causal effects are observational and experimental (randomized) studies. Observational studies can suffer from unmeasured confounding, which may render the causal effects unidentifiable. On the other hand, direct experiments on the target variable may be too costly or even infeasible to conduct. A middle ground between these two approaches is to estimate the causal effect of interest through proxy experiments, which are conducted on variables with a lower cost to intervene on compared to the main target. Akbari et al. [2022] studied this setting and demonstrated that the problem of designing the optimal (minimum-cost) experiment for causal effect identification is NP-complete and provided a naive algorithm that may require solving exponentially many NP-hard problems as a sub-routine in the worst case. In this work, we provide a few reformulations of the problem that allow for designing significantly more efficient algorithms to solve it as witnessed by our extensive simulations. Additionally, we study the closely-related problem of designing experiments that enable us to identify a given effect through valid adjustments sets.
