Algorithmic Identification of Essential Exogenous Nodes for Causal Sufficiency in Brain Networks
Abdolmahdi Bagheri, Mahdi Dehshiri, Babak Nadjar Araabi, Alireza Akhondi Asl
TL;DR
This work tackles causal sufficiency in brain networks by proposing an algorithmic pipeline to identify essential exogenous nodes that must be included to avoid spurious causal links. The method combines PC-like independence testing, KS-based discrimination of candidate confounders across networks, and NF-iVAE with Correlation Coefficient index to confirm confounding variables, all in a population-based framework. Applied to the HCP movie-watching dataset with Schaefer-based parcellation plus subcortical regions, the approach identifies dorsal regions as confounders for visual networks and validates findings against neuroscientific expectations, with reliability shown across 30 NF-iVAE initializations. The results demonstrate that incorporating identified exogenous nodes enables more accurate causal discovery in larger brain networks and provides a practical pathway for applying existing causal methods to smaller, confounded subgraphs while preserving causal_sufficiency.
Abstract
In the investigation of any causal mechanisms, such as the brain's causal networks, the assumption of causal sufficiency plays a critical role. Notably, neglecting this assumption can result in significant errors, a fact that is often disregarded in the causal analysis of brain networks. In this study, we propose an algorithmic identification approach for determining essential exogenous nodes that satisfy the critical need for causal sufficiency to adhere to it in such inquiries. Our approach consists of three main steps: First, by capturing the essence of the Peter-Clark (PC) algorithm, we conduct independence tests for pairs of regions within a network, as well as for the same pairs conditioned on nodes from other networks. Next, we distinguish candidate confounders by analyzing the differences between the conditional and unconditional results, using the Kolmogorov-Smirnov test. Subsequently, we utilize Non-Factorized identifiable Variational Autoencoders (NF-iVAE) along with the Correlation Coefficient index (CCI) metric to identify the confounding variables within these candidate nodes. Applying our method to the Human Connectome Projects (HCP) movie-watching task data, we demonstrate that while interactions exist between dorsal and ventral regions, only dorsal regions serve as confounders for the visual networks, and vice versa. These findings align consistently with those resulting from the neuroscientific perspective. Finally, we show the reliability of our results by testing 30 independent runs for NF-iVAE initialization.
