BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs
Zifeng Wang, Zichen Wang, Balasubramaniam Srinivasan, Vassilis N. Ioannidis, Huzefa Rangwala, Rishita Anubhai
TL;DR
BioBridge addresses the challenge of unlocking multimodal reasoning for biomedical foundation models by introducing a KG-guided bridge that connects frozen unimodal embeddings. It learns a relation-aware additive transformation via a bridge module and modality projections, optimized with a contrastive InfoNCE objective $\mathcal{L}_{ij}$, while the base FMs remain fixed. The approach yields strong cross-modal retrieval, semantic alignment, and out-of-domain generalization, and supports retrieval-augmented multimodal generation for drug discovery prompts, achieving substantial improvements over traditional KG embeddings. This work offers a data-efficient path to multimodal biomedical AI and suggests extensions to connect pre-trained FMs across domains through knowledge graphs.
Abstract
Foundation models (FMs) are able to leverage large volumes of unlabeled data to demonstrate superior performance across a wide range of tasks. However, FMs developed for biomedical domains have largely remained unimodal, i.e., independently trained and used for tasks on protein sequences alone, small molecule structures alone, or clinical data alone. To overcome this limitation of biomedical FMs, we present BioBridge, a novel parameter-efficient learning framework, to bridge independently trained unimodal FMs to establish multimodal behavior. BioBridge achieves it by utilizing Knowledge Graphs (KG) to learn transformations between one unimodal FM and another without fine-tuning any underlying unimodal FMs. Our empirical results demonstrate that BioBridge can beat the best baseline KG embedding methods (on average by around 76.3%) in cross-modal retrieval tasks. We also identify BioBridge demonstrates out-of-domain generalization ability by extrapolating to unseen modalities or relations. Additionally, we also show that BioBridge presents itself as a general purpose retriever that can aid biomedical multimodal question answering as well as enhance the guided generation of novel drugs.
