TraCE: Trajectory Counterfactual Explanation Scores
Jeffrey N. Clark, Edward A. Small, Nawid Keshtmand, Michelle W. L. Wan, Elena Fillola Mayoral, Enrico Werner, Christopher P. Bourdeaux, Raul Santos-Rodriguez
TL;DR
TraCE introduces a model-agnostic framework that condenses progress in sequential decisions into a single interpretable score by leveraging counterfactual trajectories. The score $S\in[-1,1]$ combines angle and distance alignments via $S(x_t,x'_t)=\lambda R_1(x_t,x'_t)+(1-\lambda)R_2(x_t,x'_t)$, enabling benchmarking across time and multiple counterfactual targets with a simple interpretation. The authors validate TraCE in two domains: ICU patient trajectories using MIMIC data and monitoring of global development against SSP projections, showing discriminative power between outcomes and useful insights for real-time decision support. This work suggests a path toward standardized, explainable progress metrics for complex sequential tasks, with future work on feature weighting, higher-dimensional data, and deployment in clinical and policy contexts.
Abstract
Counterfactual explanations, and their associated algorithmic recourse, are typically leveraged to understand, explain, and potentially alter a prediction coming from a black-box classifier. In this paper, we propose to extend the use of counterfactuals to evaluate progress in sequential decision making tasks. To this end, we introduce a model-agnostic modular framework, TraCE (Trajectory Counterfactual Explanation) scores, which is able to distill and condense progress in highly complex scenarios into a single value. We demonstrate TraCE's utility across domains by showcasing its main properties in two case studies spanning healthcare and climate change.
