Causal Entropy and Information Gain for Measuring Causal Control

Francisco Nunes Ferreira Quialheiro Simoes; Mehdi Dastani; Thijs van Ommen

Causal Entropy and Information Gain for Measuring Causal Control

Francisco Nunes Ferreira Quialheiro Simoes, Mehdi Dastani, Thijs van Ommen

TL;DR

The paper tackles the challenge of causal interpretability by extending entropy-based information theory to the interventional setting. It introduces causal entropy $H_c(Y\,|\, \\text{do}(X\sim X'))$ and causal information gain $I_c(Y\,|\, \\text{do}(X\sim X'))$, defined with intervention protocols, to quantify how much control a feature $X$ can exert over an outcome $Y$ and to relate this control to total causal effects. A key contribution is the formalization of these measures and the demonstration that they can correctly identify causal control in a structured example where standard mutual information misleads due to confounding. The work establishes a theoretical foundation for causally interpretable feature selection and motivates future estimators, extensions to continuous variables, and practical applications in domains with known or learnable causal structure.

Abstract

Artificial intelligence models and methods commonly lack causal interpretability. Despite the advancements in interpretable machine learning (IML) methods, they frequently assign importance to features which lack causal influence on the outcome variable. Selecting causally relevant features among those identified as relevant by these methods, or even before model training, would offer a solution. Feature selection methods utilizing information theoretical quantities have been successful in identifying statistically relevant features. However, the information theoretical quantities they are based on do not incorporate causality, rendering them unsuitable for such scenarios. To address this challenge, this article proposes information theoretical quantities that incorporate the causal structure of the system, which can be used to evaluate causal importance of features for some given outcome variable. Specifically, we introduce causal versions of entropy and mutual information, termed causal entropy and causal information gain, which are designed to assess how much control a feature provides over the outcome variable. These newly defined quantities capture changes in the entropy of a variable resulting from interventions on other variables. Fundamental results connecting these quantities to the existence of causal effects are derived. The use of causal information gain in feature selection is demonstrated, highlighting its superiority over standard mutual information in revealing which features provide control over a chosen outcome variable. Our investigation paves the way for the development of methods with improved interpretability in domains involving causation.

Causal Entropy and Information Gain for Measuring Causal Control

TL;DR

The paper tackles the challenge of causal interpretability by extending entropy-based information theory to the interventional setting. It introduces causal entropy

and causal information gain

, defined with intervention protocols, to quantify how much control a feature

can exert over an outcome

and to relate this control to total causal effects. A key contribution is the formalization of these measures and the demonstration that they can correctly identify causal control in a structured example where standard mutual information misleads due to confounding. The work establishes a theoretical foundation for causally interpretable feature selection and motivates future estimators, extensions to continuous variables, and practical applications in domains with known or learnable causal structure.

Abstract

Paper Structure (5 sections, 4 equations)

This paper contains 5 sections, 4 equations.

Introduction
Formal Setting
Structural Causal Models
Entropy and Mutual Information
Running Example - Comparing Control Over an Outcome

Theorems & Definitions (7)

definition thmcounterdefinition: Structural Causal Model
definition thmcounterdefinition: Entailed distribution
definition thmcounterdefinition: Atomic intervention
definition thmcounterdefinition: Total Causal Effect
definition thmcounterdefinition: Entropy and Conditional Entropy thomas2006information
remark thmcounterremark
definition thmcounterdefinition: Mutual Information thomas2006information

Causal Entropy and Information Gain for Measuring Causal Control

TL;DR

Abstract

Causal Entropy and Information Gain for Measuring Causal Control

Authors

TL;DR

Abstract

Table of Contents

Theorems & Definitions (7)