From Correlation to Causation: Understanding Climate Change through Causal Analysis and LLM Interpretations
Shan Shan
TL;DR
The paper tackles moving from correlation to causation in climate-change research by proposing a three-step causal inference framework that combines correlation analysis, ML-based causality discovery, and LLM-guided interpretations. It contributes a structured methodology that narrows the variable pool, constructs and prunes causal graphs via CAM pruning, and validates interpretations through LLM prompts aligned with a formal causal taxonomy and Pearl’s Causal Hierarchy. Key findings highlight strong causal effects of Access to Clean Fuels and Technologies for Cooking and Urban Population on per-capita carbon emissions, offering policy-relevant levers for emissions reduction. The work demonstrates a practical pathway for data-driven policymaking by integrating observational data with causal inference and interpretable, language-model-based insights.
Abstract
This research presents a three-step causal inference framework that integrates correlation analysis, machine learning-based causality discovery, and LLM-driven interpretations to identify socioeconomic factors influencing carbon emissions and contributing to climate change. The approach begins with identifying correlations, progresses to causal analysis, and enhances decision making through LLM-generated inquiries about the context of climate change. The proposed framework offers adaptable solutions that support data-driven policy-making and strategic decision-making in climate-related contexts, uncovering causal relationships within the climate change domain.
