iLOCO: Distribution-Free Inference for Feature Interactions
Camille Little, Lili Zheng, Genevera Allen
TL;DR
This work introduces iLOCO, a model-agnostic, distribution-free metric to quantify pairwise and higher-order feature interactions and to provide calibrated uncertainty via confidence intervals. By framing iLOCO within a functional ANOVA decomposition, it interprets joint interaction contributions as sums of squared ANOVA components and extends to higher-order terms. The authors develop two scalable inference approaches, data-splitting and minipatch ensembles, with rigorous asymptotic validity under mild moment conditions, demonstrated through simulations and real-data case studies. The results offer the first practical, inferential tool for detecting and quantifying feature interactions with uncertainty quantification, enabling more reliable interpretation and decision-making in complex models.
Abstract
Feature importance measures are widely studied and are essential for understanding model behavior, guiding feature selection, and enhancing interpretability. However, many machine learning fitted models involve complex interactions between features. Existing feature importance metrics fail to capture these pairwise or higher-order effects, while existing interaction metrics often suffer from limited applicability or excessive computation; no methods exist to conduct statistical inference for feature interactions. To bridge this gap, we first propose a new model-agnostic metric, interaction Leave-One-Covariate-Out (iLOCO), for measuring the importance of pairwise feature interactions, with extensions to higher-order interactions. Next, we leverage recent advances in LOCO inference to develop distribution-free and assumption-light confidence intervals for our iLOCO metric. To address computational challenges, we also introduce an ensemble learning method for calculating the iLOCO metric and confidence intervals that we show is both computationally and statistically efficient. We validate our iLOCO metric and our confidence intervals on both synthetic and real data sets, showing that our approach outperforms existing methods and provides the first inferential approach to detecting feature interactions.
