The most likely common cause
A. Hovhannisyan, A. E. Allahverdyan
TL;DR
The paper tackles causal insufficiency by assuming a latent common cause $C$ for observed variables $A$ and $B$ when only the joint distribution $p(a,b)$ is observed, leading to non-identifiability. It introduces a generalized likelihood $L_\beta$ with $0<\beta<1$ that connects to the maximum entropy principle to infer the most likely $p(a,c)$ and $p(b|c)$ consistent with CCP, while addressing identifiability via a fixed $|C|$ and linking to free-energy concepts. In binary setups, it reveals a phase-transition-like non-analytic change in the inferred cause as correlations shift, and it extends the analysis to three variables where latent causes can induce structures beyond DAG representations (TODAGs). The work also compares generalized likelihood to predictive likelihood and minimum common entropy, arguing that GL provides a more consistent and flexible framework for latent-confounder inference, with implications for causal modeling and potential extensions to higher dimensions and continuous domains.
Abstract
The common cause principle for two random variables $A$ and $B$ is examined in the case of causal insufficiency, when their common cause $C$ is known to exist, but only the joint probability of $A$ and $B$ is observed. As a result, $C$ cannot be uniquely identified (the latent confounder problem). We show that the generalized maximum likelihood method can be applied to this situation and allows identification of $C$ that is consistent with the common cause principle. It closely relates to the maximum entropy principle. Investigation of the two binary symmetric variables reveals a non-analytic behavior of conditional probabilities reminiscent of a second-order phase transition. This occurs during the transition from correlation to anti-correlation in the observed probability distribution. The relation between the generalized likelihood approach and alternative methods, such as predictive likelihood and the minimum common cause entropy, is discussed. The consideration of the common cause for three observed variables (and one hidden cause) uncovers causal structures that defy representation through directed acyclic graphs with the Markov condition.
