EquiTabPFN: A Target-Permutation Equivariant Prior Fitted Networks
Michael Arbel, David Salinas, Frank Hutter
TL;DR
EquiTabPFN introduces a target-permutation equivariant architecture for prior-fitted networks to handle arbitrary class counts in tabular data. By integrating a target-equivariant encoder, alternating bi-attention across features and samples, and a non-parametric equivariant decoder, the model achieves robust in-context learning without fixed target dimensionality. Theoretical results show that the optimal pre-training objective naturally favors target-equivariant functions, and empirical results demonstrate strong performance and runtime efficiency on unseen-class benchmarks compared to existing PFN variants. This work provides a principled approach to leveraging symmetry in tabular data, reducing the need for costly ensembling while enabling scalable classification across diverse task sizes.
Abstract
Recent foundational models for tabular data, such as TabPFN, excel at adapting to new tasks via in-context learning, but remain constrained to a fixed, pre-defined number of target dimensions-often necessitating costly ensembling strategies. We trace this constraint to a deeper architectural shortcoming: these models lack target equivariance, so that permuting target dimension orderings alters their predictions. This deficiency gives rise to an irreducible "equivariance gap", an error term that introduces instability in predictions. We eliminate this gap by designing a fully target-equivariant architecture-ensuring permutation invariance via equivariant encoders, decoders, and a bi-attention mechanism. Empirical evaluation on standard classification benchmarks shows that, on datasets with more classes than those seen during pre-training, our model matches or surpasses existing methods while incurring lower computational overhead.
