Counterfactual Maps: What They Are and How to Find Them
Awa Khouna, Julien Ferry, Thibaut Vidal
TL;DR
This work tackles the challenge of generating globally optimal counterfactual explanations for tree ensembles, whose predictions are piecewise constant over axis-aligned hyperrectangles. It introduces counterfactual maps, a global, preprocessable representation that reduces recourse to nearest-region search and yields optimal counterfactuals by projecting onto the nearest hyperrectangle with an alternative label, $y'$. The method has a one-time preprocessing stage that extracts a faithful hyperrectangle partition and builds per-target-class volumetric KD-trees, followed by a query-time certified nearest-region search with optimality guarantees. Experiments on real tabular datasets demonstrate millisecond-scale query latency and superior performance to both exact (OCEAN) and heuristic baselines, with scalable preprocessing. Overall, counterfactual maps enable scalable, trustworthy recourse in interactive deployments and point to extensions to other piecewise-constant models.
Abstract
Counterfactual explanations are a central tool in interpretable machine learning, yet computing them exactly for complex models remains challenging. For tree ensembles, predictions are piecewise constant over a large collection of axis-aligned hyperrectangles, implying that an optimal counterfactual for a point corresponds to its projection onto the nearest rectangle with an alternative label under a chosen metric. Existing methods largely overlook this geometric structure, relying either on heuristics with no optimality guarantees or on mixed-integer programming formulations that do not scale to interactive use. In this work, we revisit counterfactual generation through the lens of nearest-region search and introduce counterfactual maps, a global representation of recourse for tree ensembles. Leveraging the fact that any tree ensemble can be compressed into an equivalent partition of labeled hyperrectangles, we cast counterfactual search as the problem of identifying the generalized Voronoi cell associated with the nearest rectangle of an alternative label. This leads to an exact, amortized algorithm based on volumetric k-dimensional (KD) trees, which performs branch-and-bound nearest-region queries with explicit optimality certificates and sublinear average query time after a one-time preprocessing phase. Our experimental analyses on several real datasets drawn from high-stakes application domains show that this approach delivers globally optimal counterfactual explanations with millisecond-level latency, achieving query times that are orders of magnitude faster than existing exact, cold-start optimization methods.
