FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks
Syed Asad Rizvi, Nazreen Pallikkavaliyaveetil, David Zhang, Zhuoyang Lyu, Nhi Nguyen, Haoran Lyu, Benjamin Christensen, Josue Ortega Caro, Antonio H. O. Fonseca, Emanuele Zappala, Maryam Bagherian, Christopher Averill, Chadi G. Abdallah, Amin Karbasi, Rex Ying, Maria Brbic, Rahul Madhav Dhodapkar, David van Dijk
TL;DR
This work introduces Foundation-Informed Message Passing (FIMP), a framework that repurposes the self-attention mechanisms of pretrained non-textual foundation models to drive cross-node message passing in graphs. By unifying node tokenization and performing cross-node attention over token sequences, FIMP enables messages to be generated from powerful pretrained models such as ViT, scGPT, and BrainLM, or learned from scratch in a base configuration. Across image networks, spatial transcriptomics, and brain fMRI data, FIMP demonstrates consistent performance gains over traditional GNNs and related baselines, including strong zero-shot embeddings with pretrained ViT on image graphs. The results underscore the potential of cross-domain foundation models to enhance graph-based tasks and open avenues for multimodal and zero-shot graph learning, with future work addressing broader domains and computational efficiency.
Abstract
Foundation models have achieved remarkable success across many domains, relying on pretraining over vast amounts of data. Graph-structured data often lacks the same scale as unstructured data, making the development of graph foundation models challenging. In this work, we propose Foundation-Informed Message Passing (FIMP), a Graph Neural Network (GNN) message-passing framework that leverages pretrained non-textual foundation models in graph-based tasks. We show that the self-attention layers of foundation models can effectively be repurposed on graphs to perform cross-node attention-based message-passing. Our model is evaluated on a real-world image network dataset and two biological applications (single-cell RNA sequencing data and fMRI brain activity recordings) in both finetuned and zero-shot settings. FIMP outperforms strong baselines, demonstrating that it can effectively leverage state-of-the-art foundation models in graph tasks.
