Synthesising Counterfactual Explanations via Label-Conditional Gaussian Mixture Variational Autoencoders
Junqi Jiang, Francesco Leofante, Antonio Rago, Francesca Toni
TL;DR
The paper addresses generating counterfactual explanations that balance validity, proximity, plausibility, diversity, and robustness to perturbations. It introduces L-GMVAE, a label-conditioned Gaussian mixture VAE that assigns class-specific latent clusters, and LAPACE, which constructs CE paths by linearly interpolating from a input's latent code to fixed class centroids and decoding the path into the input space. This approach yields CEs that are robust to input changes, while offering a spectrum of recourses from near-proximate to highly robust, and supports actionability constraints via lightweight gradient updates. Empirical results across four tabular datasets demonstrate competitive metrics, perfect robustness to input changes in the best variant, strong plausibility, and fast inference, highlighting practical utility for model-agnostic recourse generation.
Abstract
Counterfactual explanations (CEs) provide recourse recommendations for individuals affected by algorithmic decisions. A key challenge is generating CEs that are robust against various perturbation types (e.g. input and model perturbations) while simultaneously satisfying other desirable properties. These include plausibility, ensuring CEs reside on the data manifold, and diversity, providing multiple distinct recourse options for single inputs. Existing methods, however, mostly struggle to address these multifaceted requirements in a unified, model-agnostic manner. We address these limitations by proposing a novel generative framework. First, we introduce the Label-conditional Gaussian Mixture Variational Autoencoder (L-GMVAE), a model trained to learn a structured latent space where each class label is represented by a set of Gaussian components with diverse, prototypical centroids. Building on this, we present LAPACE (LAtent PAth Counterfactual Explanations), a model-agnostic algorithm that synthesises entire paths of CE points by interpolating from inputs' latent representations to those learned latent centroids. This approach inherently ensures robustness to input changes, as all paths for a given target class converge to the same fixed centroids. Furthermore, the generated paths provide a spectrum of recourse options, allowing users to navigate the trade-off between proximity and plausibility while also encouraging robustness against model changes. In addition, user-specified actionability constraints can also be easily incorporated via lightweight gradient optimisation through the L-GMVAE's decoder. Comprehensive experiments show that LAPACE is computationally efficient and achieves competitive performance across eight quantitative metrics.
