Does TabPFN Understand Causal Structures?
Omar Swelam, Lennart Purucker, Jake Robertson, Hanne Raum, Joschka Boedecker, Frank Hutter
TL;DR
This work asks whether a tabular foundation model pretrained on synthetic causal data encodes causal structure in its representations. It introduces an adapter with a learnable dual-attention decoder and universal tokens to extract causal signals from TabPFN's frozen embeddings and decode them into adjacency matrices. The study shows that TabPFN embeddings contain causal information, concentrated in mid-range layers, and that the approach can outperform traditional causal discovery algorithms on synthetic benchmarks. Overall, the results point to a promising direction for using pre-trained tabular models to support interpretable and adaptable causal discovery across domains.
Abstract
Causal discovery is fundamental for multiple scientific domains, yet extracting causal information from real world data remains a significant challenge. Given the recent success on real data, we investigate whether TabPFN, a transformer-based tabular foundation model pre-trained on synthetic datasets generated from structural causal models, encodes causal information in its internal representations. We develop an adapter framework using a learnable decoder and causal tokens that extract causal signals from TabPFN's frozen embeddings and decode them into adjacency matrices for causal discovery. Our evaluations demonstrate that TabPFN's embeddings contain causal information, outperforming several traditional causal discovery algorithms, with such causal information being concentrated in mid-range layers. These findings establish a new direction for interpretable and adaptable foundation models and demonstrate the potential for leveraging pre-trained tabular models for causal discovery.
