IIFE: Interaction Information Based Automated Feature Engineering
Tom Overman, Diego Klabjan, Jean Utke
TL;DR
The paper addresses automated feature engineering (AutoFE) by introducing IIFE, an iterative method that uses interaction information to identify synergistic feature pairs and construct high-quality features. It formalizes the interaction information as $\tau_{ij}=I(F_i,F_j,Y)=I(F_i,F_j|Y)-I(F_i,F_j)$ and uses this score to drive successive feature pairings, uni-, and bi-variate transformations, validated through cross-validated model evaluation. Empirical results show IIFE outperforms recent AutoFE baselines on multiple public datasets and on a large-scale private dataset, while also enabling acceleration of other expand-reduce AutoFE approaches and enabling combination with other methods for further gains. The work also critiques prevalent AutoFE evaluation practices, demonstrating inflated cross-validation gains and proposing inductive hold-out evaluation as a more realistic metric, with practical implications for reproducibility and fair comparisons.
Abstract
Automated feature engineering (AutoFE) is the process of automatically building and selecting new features that help improve downstream predictive performance. While traditional feature engineering requires significant domain expertise and time-consuming iterative testing, AutoFE strives to make feature engineering easy and accessible to all data science practitioners. We introduce a new AutoFE algorithm, IIFE, based on determining which feature pairs synergize well through an information-theoretic perspective called interaction information. We demonstrate the superior performance of IIFE over existing algorithms. We also show how interaction information can be used to improve existing AutoFE algorithms. Finally, we highlight several critical experimental setup issues in the existing AutoFE literature and their effects on performance.
