Interpretable Feature Interaction via Statistical Self-supervised Learning on Tabular Data
Xiaochen Zhang, Haoyi Xiong
TL;DR
Spofe introduces a statistically principled, self-supervised framework for interpretable feature interactions on tabular data by uniting kernel PCA with a sparse polynomial representation. It provides a rigorous theory for error control and false discovery rate (FDR) via a multi-objective knockoff procedure, coupled with p-value based feature significance testing. Empirically, Spofe outperforms KPCA, SKPCA, and several baselines in regression and classification tasks, and its interpretability is demonstrated through Higgs and superconductivity case studies. The approach enables reliable identification of interacting features with tangible, physics-aligned insights, making it suitable for high-stakes domains where transparency and statistical validity are essential.
Abstract
In high-dimensional and high-stakes contexts, ensuring both rigorous statistical guarantees and interpretability in feature extraction from complex tabular data remains a formidable challenge. Traditional methods such as Principal Component Analysis (PCA) reduce dimensionality and identify key features that explain the most variance, but are constrained by their reliance on linear assumptions. In contrast, neural networks offer assumption-free feature extraction through self-supervised learning techniques such as autoencoders, though their interpretability remains a challenge in fields requiring transparency. To address this gap, this paper introduces Spofe, a novel self-supervised machine learning pipeline that marries the power of kernel principal components for capturing nonlinear dependencies with a sparse and principled polynomial representation to achieve clear interpretability with statistical rigor. Underpinning our approach is a robust theoretical framework that delivers precise error bounds and rigorous false discovery rate (FDR) control via a multi-objective knockoff selection procedure; it effectively bridges the gap between data-driven complexity and statistical reliability via three stages: (1) generating self-supervised signals using kernel principal components to model complex patterns, (2) distilling these signals into sparse polynomial functions for improved interpretability, and (3) applying a multi-objective knockoff selection procedure with significance testing to rigorously identify important features. Extensive experiments on diverse real-world datasets demonstrate the effectiveness of Spofe, consistently surpassing KPCA, SKPCA, and other methods in feature selection for regression and classification tasks. Visualization and case studies highlight its ability to uncover key insights, enhancing interpretability and practical utility.
