Bayesian Networks for Causal Analysis in Socioecological Systems
Rafael Cabañas, Ana D. Maldonado, María Morales, Pedro A. Aguilera, Antonio Salmerón
TL;DR
This work advances causal analysis in socioecological systems by applying structural causal models (SCMs) to observational data and leveraging EMCC to bound counterfactual queries. It extends Bayesian networks with post-intervention and twin (counterfactual) models to quantify necessity and sufficiency relations among socioecological variables, demonstrated on a southern Spain case study of land-use and population dynamics. Key findings show immigration as a strong, often necessary and sufficient driver of population growth, and geography and density as critical factors shaping land-use outcomes, illustrating the added value of counterfactual reasoning over conventional BN analyses. The methodology enables policy-relevant insights where interventional data are unavailable, with broader applicability to ecosystem services, risk assessment, and adaptive management in environmental science.
Abstract
Causal and counterfactual reasoning are emerging directions in data science that allow us to reason about hypothetical scenarios. This is particularly useful in fields like environmental and ecological sciences, where interventional data are usually not available. Structural causal models are probabilistic models for causal analysis that simplify this kind of reasoning due to their graphical representation. They can be regarded as extensions of the so-called Bayesian networks, a well known modeling tool commonly used in environmental and ecological problems. The main contribution of this paper is to analyze the relations of necessity and sufficiency between the variables of a socioecological system using counterfactual reasoning with Bayesian networks. In particular, we consider a case study involving socioeconomic factors and land-uses in southern Spain. In addition, this paper aims to be a coherent overview of the fundamental concepts for applying counterfactual reasoning, so that environmental researchers with a background in Bayesian networks can easily take advantage of the structural causal model formalism.
