Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction
Zhixuan Chu, Mengxuan Hu, Qing Cui, Longfei Li, Sheng Li
TL;DR
The paper tackles the challenge of trustworthy risk prediction by addressing the lack of causal reasoning and the prevalence of class imbalance in traditional models. It introduces Task-Driven Causal Feature Distillation (TDCFD), which uses the Potential Outcome Framework to distill task-specific causal attributions for each feature via relational graph construction, adaptive group Lasso propensity score estimation, and causal feature attribution estimation, followed by risk prediction using a neural network on these distilled features. The authors provide theoretical guarantees for estimator consistency and unbiased causal estimation, and empirically demonstrate that TDCFD achieves superior precision and recall while offering causal interpretability on both synthetic and real-world datasets. This approach advances trustworthy AI by producing predictions that are not only accurate but also causally explainable, enabling better decision-making in finance, healthcare, and beyond.
Abstract
Since artificial intelligence has seen tremendous recent successes in many areas, it has sparked great interest in its potential for trustworthy and interpretable risk prediction. However, most models lack causal reasoning and struggle with class imbalance, leading to poor precision and recall. To address this, we propose a Task-Driven Causal Feature Distillation model (TDCFD) to transform original feature values into causal feature attributions for the specific risk prediction task. The causal feature attribution helps describe how much contribution the value of this feature can make to the risk prediction result. After the causal feature distillation, a deep neural network is applied to produce trustworthy prediction results with causal interpretability and high precision/recall. We evaluate the performance of our TDCFD method on several synthetic and real datasets, and the results demonstrate its superiority over the state-of-the-art methods regarding precision, recall, interpretability, and causality.
