DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation
Yinjun Wu, Mayank Keoliya, Kan Chen, Neelay Velingker, Ziyang Li, Emily J Getzen, Qi Long, Mayur Naik, Ravi B Parikh, Eric Wong
TL;DR
DISCRET tackles faithful yet accurate ITE estimation by automatically synthesizing per-sample rule-based explanations that also serve to retrieve similar subgroups for local treatment effect estimation. It leverages a tailored deep Q-learning framework to produce conjunctive-disjunctive rules and guarantees local satisfiability for each target instance, yielding near‑perfect faithfulness (consistency) while maintaining accuracy on par with black-box models. The framework can regularize strong neural predictors by aligning their outputs with DISCRET, improving performance across tabular, image, and text domains, and it demonstrates substantial empirical gains over self‑interpretable baselines. The work contributes a novel combination of rule synthesis, causal ITE theory, and RL-based training, with practical implications for trustworthy treatment-effect decision-making in diverse data modalities.
Abstract
Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers for black-box models lack faithfulness guarantees, and self-interpretable models greatly compromise accuracy. To address these issues, we propose DISCRET, a self-interpretable ITE framework that synthesizes faithful, rule-based explanations for each sample. A key insight behind DISCRET is that explanations can serve dually as database queries to identify similar subgroups of samples. We provide a novel RL algorithm to efficiently synthesize these explanations from a large search space. We evaluate DISCRET on diverse tasks involving tabular, image, and text data. DISCRET outperforms the best self-interpretable models and has accuracy comparable to the best black-box models while providing faithful explanations. DISCRET is available at https://github.com/wuyinjun-1993/DISCRET-ICML2024.
