Graph Representation Learning in Biomedicine
Michelle M. Li, Kexin Huang, Marinka Zitnik
TL;DR
Graph representation learning in biomedicine analyzes how to embed heterogeneous biomedical graphs into compact vectors to support prediction, discovery, and interpretation across molecular, genomic, therapeutic, and healthcare domains. The paper surveys three core families—shallow embedding methods, graph neural networks, and generative graph models—and connects them to long-standing systems biology principles to explain successes and limitations. It highlights applications including predicting molecular interactions, disease mechanisms, drug actions, and patient-level predictions, and discusses scalability, interpretability, and data integration challenges. The work provides a unified framework and roadmap for future graph-based biomedicine research, with emphasis on multi-scale knowledge graphs, spatial and single-cell data, and responsible deployment in clinical settings.
Abstract
Biomedical networks (or graphs) are universal descriptors for systems of interacting elements, from molecular interactions and disease co-morbidity to healthcare systems and scientific knowledge. Advances in artificial intelligence, specifically deep learning, have enabled us to model, analyze, and learn with such networked data. In this review, we put forward an observation that long-standing principles of systems biology and medicine -- while often unspoken in machine learning research -- provide the conceptual grounding for representation learning on graphs, explain its current successes and limitations, and even inform future advancements. We synthesize a spectrum of algorithmic approaches that, at their core, leverage graph topology to embed networks into compact vector spaces. We also capture the breadth of ways in which representation learning has dramatically improved the state-of-the-art in biomedical machine learning. Exemplary domains covered include identifying variants underlying complex traits, disentangling behaviors of single cells and their effects on health, assisting in diagnosis and treatment of patients, and developing safe and effective medicines.
