Local Causal Discovery with Background Knowledge
Qingyuan Zheng, Yue Liu, Yangbo He
TL;DR
This work develops a local causal discovery framework that leverages background knowledge to refine Markov equivalence classes into a maximally partially directed acyclic graph (MPDAG) and identify causal relations from the local neighborhood of a target variable. It introduces MB-by-MB in MPDAG to learn local structure under direct, non-ancestral, and ancestral knowledge, and derives sufficient and necessary criteria for classifying definite descendants, definite non-descendants, and possible descendants using only local information (including explicit vs. implicit causes). The paper also presents LABITER, a locally focused algorithm that integrates background knowledge to identify causal relations more accurately and efficiently than full-graph methods, with demonstrated improvements in local-structure learning, causal identification, and counterfactual fairness. Applications to fair ML and real data (e.g., Bank Marketing) illustrate practical impact, showing that exploiting prior knowledge can yield more plausible causal inferences and reduced unfairness without sacrificing predictive performance.
Abstract
Causality plays a pivotal role in various fields of study. Based on the framework of causal graphical models, previous works have proposed identifying whether a variable is a cause or non-cause of a target in every Markov equivalent graph solely by learning a local structure. However, the presence of prior knowledge, often represented as a partially known causal graph, is common in many causal modeling applications. Leveraging this prior knowledge allows for the further identification of causal relationships. In this paper, we first propose a method for learning the local structure using all types of causal background knowledge, including direct causal information, non-ancestral information and ancestral information. Then we introduce criteria for identifying causal relationships based solely on the local structure in the presence of prior knowledge. We also apply out method to fair machine learning, and experiments involving local structure learning, causal relationship identification, and fair machine learning demonstrate that our method is both effective and efficient.
