xRFM: Accurate, scalable, and interpretable feature learning models for tabular data

Daniel Beaglehole; David Holzmüller; Adityanarayanan Radhakrishnan; Mikhail Belkin

xRFM: Accurate, scalable, and interpretable feature learning models for tabular data

Daniel Beaglehole, David Holzmüller, Adityanarayanan Radhakrishnan, Mikhail Belkin

TL;DR

xRFM tackles tabular data prediction by merging feature-learning kernel machines with a tree-based partitioning scheme to capture local data structure. The approach enables local feature learning in leaves while maintaining near-linear training time and logarithmic inference, and it provides native interpretability through the Average Gradient Outer Product. Empirically, xRFM achieves state-of-the-art performance on 100 tabular regression datasets and remains competitive on 200 classification datasets, outperforming GBDTs in several benchmarks. This scalable, interpretable framework is well-suited to uncover heterogeneity and structure in large-scale tabular data, with strong practical implications for real-world prediction tasks.

Abstract

Inference from tabular data, collections of continuous and categorical variables organized into matrices, is a foundation for modern technology and science. Yet, in contrast to the explosive changes in the rest of AI, the best practice for these predictive tasks has been relatively unchanged and is still primarily based on variations of Gradient Boosted Decision Trees (GBDTs). Very recently, there has been renewed interest in developing state-of-the-art methods for tabular data based on recent developments in neural networks and feature learning methods. In this work, we introduce xRFM, an algorithm that combines feature learning kernel machines with a tree structure to both adapt to the local structure of the data and scale to essentially unlimited amounts of training data. We show that compared to $31$ other methods, including recently introduced tabular foundation models (TabPFNv2) and GBDTs, xRFM achieves best performance across $100$ regression datasets and is competitive to the best methods across $200$ classification datasets outperforming GBDTs. Additionally, xRFM provides interpretability natively through the Average Gradient Outer Product.

xRFM: Accurate, scalable, and interpretable feature learning models for tabular data

TL;DR

Abstract

xRFM: Accurate, scalable, and interpretable feature learning models for tabular data

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)

Theorems & Definitions (3)