Countering Overfitting with Counterfactual Examples

Flavio Giorgi; Fabiano Veglianti; Fabrizio Silvestri; Gabriele Tolomei

Countering Overfitting with Counterfactual Examples

Flavio Giorgi, Fabiano Veglianti, Fabrizio Silvestri, Gabriele Tolomei

TL;DR

Overfitting degrades generalization, and the authors propose CF-Reg, a regularizer that enforces margins between training examples and their counterfactuals. Grounded in margin theory, CF-Reg is designed to be compatible with any differentiable counterfactual generator and is optimized alongside empirical risk. Empirical results across multiple datasets and architectures show CF-Reg often surpasses traditional regularizers and adversarial training in generalization, while simultaneously producing counterfactual explanations as a by-product. The work highlights a principled trade-off between generalization and explainability and points to efficiency-driven directions for practical deployment.

Abstract

Overfitting is a well-known issue in machine learning that occurs when a model struggles to generalize its predictions to new, unseen data beyond the scope of its training set. Traditional techniques to mitigate overfitting include early stopping, data augmentation, and regularization. In this work, we demonstrate that the degree of overfitting of a trained model is correlated with the ability to generate counterfactual examples. The higher the overfitting, the easier it will be to find a valid counterfactual example for a randomly chosen input data point. Therefore, we introduce CF-Reg, a novel regularization term in the training loss that controls overfitting by ensuring enough margin between each instance and its corresponding counterfactual. Experiments conducted across multiple datasets and models show that our counterfactual regularizer generally outperforms existing regularization techniques.

Countering Overfitting with Counterfactual Examples

TL;DR

Abstract

Countering Overfitting with Counterfactual Examples

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)